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US provisional application No. 60/733,669, filed November 4, 2005; US patent 
application No. 11/193,750, filed July 29, 2005; and US patent application No. 
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10 

Field of the Invention 

The present invention relates to the field of biotechnology, and more 
specifically to the field of genome modification. Disclosed herein are compositions 
including chromosomes and vectors, and methods of use thereof, for the generation of 
15 genetically transformed cells and animals including avians. 

Background 

Transgenic technology to convert animals into "bioreactors" for the production 
20 of specific proteins or other substances of pharmaceutical interest (Gordon et al, 1987, 
Biotechnology 5: 1183-1187; Wilmut et al, 1990, Theriogenology 33: 113-123) offers 
significant advantages over more conventional methods of protein production by gene 
expression. For example, recombinant nucleic acid molecules have been engineered 
and incorporated into transgenic animals so that an expressed heterologous protein 
25 may be joined lo a protein or peptide that allows secretion of the transgenic expression 
product into milk or urine, from which the protein may then be recovered. 

Another system useful for heterologous protein production is the avian 
reproductive system. The production of an avian egg begins with formation of a large 
yoUc in the ovary of the hen. The unfertiUzed oocyte or ovum is positioned on top of 
30 the yolk sac. After ovulation the ovum passes into the infundibulum of the oviduct 
where it is fertiUzed, if sperm are present, and then moves into the magnum of the 



1 



wo 2006/093847 



PCT/US2006/006752 



oviduct which is hned with tubular gland cells. These cells secrete the egg-white 
proteins, including ovalbumin, lysozyme, ovomucoid, conalbumin and ovomucin into 
the lumen of the magnum where they are deposited onto the avian embryo and yolk. 
The hen oviduct offers outstanding potential as a protein bioreactor because of the 
5 high levels of protein production, the promise of proper folding and post-translation 
modification of the target protein, the ease of product recovery, and the relatively short 
developmental period of chickens. 

One method for creating permanent genomic modification of a eukaryotic cell 
is to integrate an introduced DNA into an existing chromosome. Retroviruses have so 

10 far proven to be the method of choice for efficient integration. However, retroviral 
integration is directed to a number of insertion sites within the recipient genome so 
that positional variation in heterologous gene expression can be evident. 
Unpredictability as to which insertion site is targeted introduces an undesirable lack of 
control over the procedure. An additional limitation of the use of retroviruses is that 

15 the size of the nucleic acid molecule encoding the virus and heterologous sequences 
may be limited to about 8 kb. In addition, retroviruses may include undesirable 
features such as splice sites. Although wild-type adeno-associated virus (AAV) often 
integrates at a specific region in the human genome, replication deficient vectors 
derived from AAV do not integrate site-specifically possibly due to the deletion of the 

20 toxic rep gene. In addition, homologous recombination produces site-specific 
integration, but the frequency of such integration usually is typically low. 

An alternative method for delivering a heterologous nucleic acid into the 
genome is the use of one or more site-specific enzymes that can catalyze the insertion 
of nucleic acids into chromosomes. These enzymes recognize relatively short unique 

25 nucleic acid sequences that serve for both recognition and recombination. Examples 
include Cre (Sternberg & Hamilton, 1981, J. Mol. Biol. 150: 467-486, 1981), FIp 
(Broach et al, 1982, Cell 29: 227-234, 1982) and R (Matsuzaki et al, 1990, J. Bact. 
172: 610-618, 1990). 
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A novel class of phage integrases, that includes the integrase from the phage 
phiC3 1, can mediate highly efficient integration of transgenes in mammalian cells both 
in vitro and in vivo (Thyagarajan et al, Mol. Cell Biol. 21: 3926-3934, 2001). 
Constructs and methods of using recombinase to integrate heterologous DNA into a 
5 plant, insect or mammalian genome are described by Calos in U.S. Patent Serial No. 
6,632,672, the disclosure of which is incorporated in its entirety herein by reference. 

i'hc phiC3 1 integrase is a member of a subclass of integrases, termed serine 
recombinases, that include, for example, R4 and TP901-1. Unlike the phage lambda 
integrases, which belong to a tyrosine class of recombinases, the serine integrases do 

10 not require cof actors such as integration host factor. The phiC31 integrase normally 
mediates integration of the phiC3 1 bacteriophage into the genome of Streptomyces via 
recombination between the attP recognition sequence of the phage genome and the 
atlB recognition sequence within the bacterial genome. When a plasmid is equipped 
with a single altB site, phiCS 1 inlegrase will detect and mediate crossover bclwccn the 

15 attB site and a pseudo-attP site within the mammalian genome. Such pscudo-attP 
integration sites have now been identified in the mouse and human genomes. If the 
heterologous DNA is in a circular or supercoiled form, the entire plasmid becomes 
integrated with attL and attR arms flanking the nucleic acid insert. 

Integration mediated by certain integrases, such as PhiC3 I integrase-mediated 

20 integration, results in the alteration of the recognition or recombination sites 
themselves so that the integration reaction is irreversible. This will bypass the primary 
concern inherent with other recombinases, i.e., the reversibility of the integration 
reaction and excision of the inserted DNA. 

Another method for the stable introduction of heterologous nucleic acid (e.g., 

25 large heterologous nucleic acids) into a genome is by the use of an artificial 
chromosome. Artificial chromosomes for expression of heterologous genes in yeast 
are available, but artificial chromosomes being delivered to avians has not previously 
been achieved. 
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Therefore, it is an object of the invention to produce transgenic animals with 
large nucleic acid segments integrated into their genome and to provide avians which 
include an artificial chromosome in their genome. 

In one useful embodiment, the transgenic avians of the invention can be used 
5 to produce polyclonal antibodies to antigens of interest for therapeutic applications. 
Fully human polyclonal antibodies have proven to be effective therapeutics, and in 
certain circumstances may be more efficacious than monoclonal antibodies. 
Polyclonal antibodies, opposed to monoclonal antibodies, are of particular therapeutic 
value for use against antigenic targets that are either complex in nature, subject to 

10 resistance via mutational escape or arc highly polymorphic. For example, toxins can 
require multiple antibodies for effective neutralization. Also, pathogenic virus and 
bacteria, which can quickly mutate into resistant strains, are targets for polyclonal 
antibodies. In addition, polyclonal antibodies can be used as a masking therapeutic 
agent. For example, the polyclonal antibodies may be used in Rh disease therapy and 

15 immunosuppressive regimens associated transplant rejection and autoimmune disease. 

At present, there are approximately 20 therapeutic polyclonals on the market. 
Existing polyclonal therapeutics are derived either from animal or human serum which 
imposes certain drawbacks. For example, polyclonal antibodies have a limited in vivo 
half-Ufc. In addition, these polyclonals usually cannot be re-administered to a patient 

20 due to immune reaction, hi addition, human serum derived antibodies, while fully 
human, have both inherent production limitations as well as certain bio-safety 
concerns. 

Although human polyclonal antibodies have been produced in transgenic mice 
and cattle (reviewed in, for example, Bruggemann (2004) in Molecular Biology of B 

25 cells pp 547-561 . Academic Press and Kuroiwa et al (2002) Nature Biotechnol 20:889- 
894), there are certain limitations to each of these platforms with respect to large-scale 
manufacture of therapeutic polyclonals. For example, the levels of antibody 
production achievable in mice is extremely small by virtue of their body size, hi 
cattle, the endogenous immunoglobulin genes are not "knocked-out", since embryonic 

30 stem cell lines, necessary for knock-out procedures do not exist. Therefore, 
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contaminating bovine immunoglobulins wiU be present which wiU be difficult to 
separate from human antibodies by standard protein A/G affinity purification 
procedures. In addition, since the antibodies are produced in animal serum, there are 
biosafety and serum protein contamination problems. 
5 In order to fully realize the potential of therapeutic polyclonals, a production 

platform is needed that can efficiently produce large quantities of fully human 
polyclonal antibody. 

Transgenic chickens, which express fully human polyclonal antibodies in 
response to antigenic stimulation and deposit the antibodies into their eggs, would 

10 present such an ideal production system. For example, a single hen has a production 
capacity of over 250 eggs/year and about 50 to about 100 mg of chicken IgG (also 
termed IgY) is naturally transported into each egg produced. 

The present invention is also directed to methods of producing artificial 
chromosomes which contain large nucleic acid inserts, such as Ig loci. Producing 

15 artificial chromosomes containing a transgcnc by integrating the transgcnc into the 
chromosome can have certain limitations. For example, in some integration 
methodologies the transgene can integrate into any of the available chromosomes 
within the cell, including the host ceUs chromosomes. In certain instances 
homologous recombination, can overcome this problem. However, homologous 

20 recombination has a number of limitations including the requirement that the transgene 
be specifically engineered for the procedure. In certain useful site specific integration 
methodologies, the transfected nucleotide sequence must be circular, otherwise 
integration will introduce a double-stranded break into the artificial chromosome. To 
by-pass the need for a circular insert the vector can be equipped with two RRSs that 

25 flank the desired transgene. However, two recombinases would be required for the 
integration event and the artificial chromosome would also harbor two recombination 
sites. The complexity involved in this type of integration would result in an overall 
low rate of integration. Regardless of the integration methodology employed, the 
efficiency of integration for large transgenes is typically very much reduced relative to 

30 the integration of smaller transgenes, (e.g., up to 1000 fold reduction in efficiency for 
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transgenes over 80 kb (kilobases) relative to smaller transgenes, for example, less than 
10 kb). This may be due to certain factors such as the large size of the transgene 
lowering the rate of transfection. In addition, large transgenes can be susceptible to 
nicking and breaking due to shear forces and/or nuclease degradation. 
5 One potential difficulty in the use of artificial chromosomes in the production 

of transchromosomic animals such as avians can be difficulty in preparing a 
sufficiently homogeneous mixture of artificial chromosomes. Fluorescent synthetic 
polyamide probes have been used to obtain human chromosomes from their native 
environment by tagging repeated sequences in the chromosome with labeled 

10 polyamides (see, for example, Gygi et al. (2002) Use of fluorescent sequence- specific 
polyamides to discriminate human chromosomes by microscopy and flow cytometry. 
Nucleic Acids Res 30: 2790-9). 

Purification of artificial chromosomes by methods such as flow cytometry can 
be limited to only metaphase chromosomes, for example, larger than 60Mb in size. In 

15 certain instances artificial chromosomes which cannot be purified using conventional 
technologies (e.g., artilicial chromosomes less than about 60Mb is size) could be 
useful for the production of transchromosomic animals including transchromosomic 
avians. 

What is needed are methods which provide for the efficient introduction of 
20 artificial chromosomes into animal genomes such as avian genomes. 

Summary of the Invention 

One useful aspect of the invention relates to methods of producing 
transchromosomic avians. In one embodiment, the methods include substantially 

25 purifying a chromosome followed by introducing the purified chromosome into an 
avian embryo and thereafter maintaining the embryo under conditions suitable for the 
embryo to develop and hatch as a chick. In one embodiment, the methods include 
inserting a heterologous nucleotide sequence into the chromosome before or after 
substantially purifying the chromosome. In one embodiment, the chromosome is 

30 introduced into the avian embryo by microinjection; however, any useful method to 
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introduce the chromosome into the avian embryo is within the scope of the present 
invention. 

It is contemplated that the chromosome may be introduced into the embryo by 
delivering the chromosome to an avian cell before or after fertilization. For example, 
5 the chromosome may be introduced into an ovum or a sperm before fertilization. In 
another example, the chromosome is introduced into a cell of an embryo (e.g., stage I 
to stage XTI embryo). In one embodiment, the chromosome is introduced into an early 
stage embryo, for example, and without limitation, a stage I embryo. In one 
embodiment, the chromosome is introduced into a germinal disc. 

10 The methods provide for the introduction of any useful number of 

chromosomes into the avian embryo in order to produce a transchromosomal avian. 
For example, and without limitation, between 1 and about 10,000 chromosomes may 
be introduced into the embryo. In another example, between 1 and about 1,000 
chromosomes may be introduced into the embryo. 

15 The invention also provides for transchromosomal avian cells wherein the 

artificial chromosome includes a nucleotide sequence which encodes a therapeutic 
substance. The cells may be isolated from transchromosomal avians and thereafter 
grown in culture. The invention also contemplates the production of the 
transchromosomic avian cells by stable introduction of the artificial chromosome into 

20 cultured avian cells. Any useful method may be employed for the introduction of the 
artificial chromosome into the cultured cells including, without limitation, lipofection 
or microinjection. 

The invention also contemplates methods which include isolating an 
artificial chromosome; introducing the artificial chromosome into an avian embryo; 
25 maintaining the embryo under conditions suitable for the embryo to develop and hatch 
as a chick; and maintaining the chick under conditions suitable to obtain a mature 
avian wherein the artificial chromosome is present in the genome of the mature avian. 

In one aspect, the invention relates to methods which include isolating an 
artificial chromosomes by flow cytometry. The flow cytometry may be facilitated by a 
30 probe which is associated with the artificial chromosome. For example, the probe may 
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be a polyamide probe. In one embodiment, the probe (e.g., polyamide probe) may 

include a fluorescent molecule or tag. 

In one embodiment, the artificial chromosome is present in micronuclei. For 

example, the artificial chromosome may be present in micronuclei prior to flow 
5 cytometry purification of the artificial chi-omosome. In one embodiment, the 

micronuclcar environment protects the artificial chromosome from degradation or 

fragmentation that may occur before during or after introduction (e.g., by injection or 

lipofection) of the artificial chromosome into the avian embryo. In one embodiment, 

the micronuclei contain diploid mitotic artificial chromosomes. Production of 
1 0 micronuclei can be accomplished by any useful method known in the art, for example, 

as disclosed in Labidi et al (1987) Experimental Cell Research 617-627, the disclosure 

of which is incorporated in its entirety herein by reference. 

Typically the methods include transferring the embryo, into which the artificial 

chromosome has been introduced, into a recipient female avian. In one embodiment, 
15 the artificial chromosome is an early stage embryo such as a stage 1, stage 11, stage 111, 

stage IV, stage V or Stage VI embryo. In one useful embodiment, the embryo is a 

stage I embryo. 

In one useful embodiment, the artificial chromosome comprises one or more 
heterologous recombination sites, for example, between 1 and about 100 

20 recombination sites may be employed. T5rpically, the chromosome will include a 
heterologous coding sequence. In one useful embodiment, the heterologous coding 
sequence consists of or contains a pharmaceutical protein coding sequence. Any 
useful pharmaceutical protein coding sequence may be employed, such as those 
disclosed elsewhere herein. In addition, the artificial chromosome can include a 

25 promoter, for example, and without limitation, a promoter which functions in tubular 
gland cells. For example, the promoter may be linked to a pharmaceutical protein 
coding sequence such that the promoter initiates transcription of the heterologous 
coding sequence (i.e., the promoter is operably linked to the heterologous coding 
sequence). The invention also contemplates the inclusion of an IRES (internal 

30 ribosome entry site) in the artificial chromosome. 
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In one aspect, the invention provides for transgenic avians which produce eggs 
containing polyclonal antibodies, for example, human polyclonal antibodies. The 
invention also relates to the eggs produced by such an avian. The avians employed in 
the invention may be any useful avians, such as those avians disclosed elsewhere 
5 herein, for example chickens, quail and turkeys. The invention contemplates the 
production of chimeric birds and germline transgenic birds including Gl and Ci2 
transgenic or transchromosomic avians which produce polyclonal antibodies. 

In one useful embodiment of the invention, one or more cells of the transgenic 
avian contain an artificial chromosome which has coding sequences for a polyclonal 

10 antibody. Any useful artificial chromosome may be employed such as those having a 
centromere selected from the group consisting of an insect centromere, a mammalian 
centromere and an avian centromere. In one specific embodiment, the artificial 
chromosome is a satellite artificial chromosome. 

The invention also provides for methods of producing artificial chromosomes 

15 in cells. In one aspect, methods of the invention include introducing one or more 
transgenes into an artificial chromosome during assembly of the artificial 
chromosome. In one useful embodiment, the transgenes contain at least one of a 
promoter and a coding sequence for a therapeutic protein. In one embodiment, the 
coding sequence encodes one or more Ig loci such as IgA,, IgKg IgH, or portions thereof 

20 or combinations thereof in its germline. The methods for producing artificial 
chromosomes containing a transgene are particularly useful for the introduction of 
large transgenes into the chromosome such as portions of Ig genes, for example, 
portions of human Ig genes (e.g., an IgA, gene, an Ig H gene and/or an IgK gene). 
Certain references which include disclosure that can be useful in certain aspects of the 

25 invention include Csonka, et al (2000) Journal of Cell Science 113: 3207-3216 and 
Nicholson, et al (1999) J. Immunology 163(12):6898-6906. The disclosures of each of 
these two journal articles are incorporated in their entirety herein by reference. 

Integration of a transgene into a defined chromosomal site is useful to improve 
the predictability of expression of the transgene, which is particularly advantageous 

30 when creating transgenic vertebrate animals such as, transgenic avians. Transgenesis 
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by methods that randomly insert a transgene into a genome are often inefficient since 
the transgene may not be expressed at the desired levels or in desired tissues. 

The present invention relates to methods of modifying the genome of 
vertebrate cells (e.g., production of transgenic vertebrates, in particular, transgenic 
5 avians) and to such cells with modified genomes and their progeny. In one 
embodiment, the methods provide for introducing into vertebrate cells a first 
recombination site such that the recombination site is inserted into the vertebrate cell 
genome. Typically, in such embodiments, the genome does not normally include this 
first recombination site prior to the recombination site introduction. Methods of the 

10 invention may also include introducing a nucleotide sequence comprising a second 
recombination site and a sequence of interest such as a coding sequence into the 
vertebrate cell or progeny of the vertebrate cell. The nucleotide sequence comprising 
the second recombination site and the sequence of interest such as a coding sequence 
may be introduced into the vertebrate cell before, at about the same time as or after the 

15 introduction of the first recombination site. Additionally, the present methods may 
include introducing into the vertebrate cell or progeny cell thereof a substance which 
facilitates insertion of the nucleotide sequence comprising the second recombination 
site and the sequence of interest proximal to the first recombination site. For example, 
the nucleotide sequence comprising the second recombination site and the sequence of 

20 interest may be inserted adjacent to or intemaUy in the first recombination site. In one 
very useful embodiment, the first recombination site and/or the nucleotide sequence 
comprising the second recombination site and the sequence of interest are stably 
incorporated into the genome of the cell. 

The present invention contemplates the genomic modification of any useful 

25 vertebrate cells including, but not limited to, avian cells. Examples of cells which may 
have their genomes modified in accordance with the present invention include, without 
limitation, reproductive cells including sperm, ova and embryo cells and 
nonreproductive cells such as tubular gland cells. 

The present invention also relates to methods of producing transgenic 

30 vertebrate animals and to the transgenic animals produced by the methods and to their 
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transgenic progeny or descendents. The invention also includes the transgenic cells 
included in or produced by the transgenic vertebrate animals. Examples of such cells 
include, without limitation, germline cells, ova, sperm cells and protein producing 
cells such as tubular gland cells. In one useful embodiment, the transgenic vertebrate 
5 animals of the invention are transgenic avians. Transgenic avians of the invention may 
include, without limitation, chickens, turkeys, ducks, gccsc, quail, pheasants, parrots, 
finches, hawks, crows or ratites including ostrich, emu or cassowary. 

In accordance with the present invention, methods of producing transgenic 
vertebrate animals can include introducing into an embryo of a vertebrate animal a first 

10 recombination site such that the recombination site is present in sperm or ova of a 
mature vertebrate animal developed from the embryo. In one useful embodiment, the 
embryo does not normally include the first recombination site in its genome prior to 
the recombination site introduction. The methods may also include introducing a 
nucleotide sequence comprising a second recombination site and a sequence of interest 

15 such as a coding sequence into the embryo of the vertebrate animal. The first 
recombination site and/or the nucleotide seqiience comprising the second 
recombination site and a sequence of interest may be introduced into the embryo of the 
vertebrate animal before the embryo is fertihzed (i.e., when an ovum), at about the 
same time as introduction of the sperm into the ovum or after fertilization. 

20 The methods can also include introducing the nucleotide sequence comprising 

a second recombination site and a sequence of interest into an ovum or a sperm of a 
mature vertebrate animal developed from the embryo (or its descendents) into which 
the first recombination site was introduced. In one embodiment, the nucleotide 
sequence comprising a second recombination site and a sequence of interest is 

25 introduced into the ovum from the mature vertebrate animal before the ovum is 
fertihzed. In another embodiment, the nucleotide sequence comprising a second 
recombination site and a sequence of interest is introduced into the ovum at about the 
time of fertilization. In one particularly useful embodiment, the nucleotide sequence 
comprising a second recombination site and a sequence of interest is introduced into 

30 the ovum after the ovum is fertihzed (when an embryo). 



11 



wo 2006/093847 



PCT/US2006/006752 



The methods may include, upon addition of the nucleotide sequence 
comprising a second recombination site and a sequence of interest to an embryo, ovum 
or sperm, introducing into the embryo, ovum or sperm, a substance which facilitates 
insertion of the nucleotide sequence comprising the second recombination site and the 
5 sequence of interest proximal to the first recombination site. For example, the 
nucleotide sequence comprising the second recombination site and the sequence of 
interest may be inserted adjacent to or internally in the first recombination site. In one 
useful embodiment, the methods include introducing into an embryo comprising the 
first recombination site in its genome, a substance which facilitates insertion of the 

10 nucleotide sequence comprising the second recombination site and the sequence of 
interest proximal to the first recombination site. 

In one useful embodiment, these methods include fertilizing an ovum with 
sperm comprising the first recombination site. The methods can include also 
introducing into the ovum a nucleotide sequence comprising a second recombination 

15 site and a sequence of interest such as a coding sequence and a substance which 
facilitates insertion of the nucleotide sequence comprising the second recombination 
site and sequence of interest proximal to (e.g., adjacent to or internally in) the first 
recombination site. It is contemplated that the nucleotide sequence comprising a 
second recombination site and a sequence of interest may be introduced into the ovum 

20 before or after fertilization by the sperm or at about the same time as fertilization. 

In one very useful embodiment of the methods disclosed herein, the nucleotide 
sequence comprising the second recombination site and the sequence of interest is 
stably incorporated into the genome of the embryo, ovum or sperm. 

The methods disclosed herein typically eventually include exposing a fertilized 

25 ovum to conditions which lead to the development of a viable transgenic vertebrate 
animal. 

In one embodiment, the nucleotide sequence of interest includes an expression 
cassette. Optionally, the nucleotide sequence of interest may include a marker such as, 
but not limited to, a puromycin resistance gene, a luciferase gene, EGFP-encoding 
30 gene, and the like. 
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Typically, in accordance with methods known in the art or methods disclosed 
herein, the embryo of the vertebrate animal or fertilized ovum of a mature vertebrate 
animal of the invention is exposed to conditions which lead to the development of a 
viable transgenic vertebrate animal. 
5 Embryos that are useful in the present methods include, without limitation, 

stage 1, stage II, stage III, stage IV, stage V, stage VI, stage VII, stage VIII, stage IX, 
stage X, stage XI and stage XII embryos. 

In one embodiment, the nucleotide sequence included with the second 
recombination site of interest is a coding sequence. The nucleotide sequence of 

10 interest included with the second recombination site can be of any useful size. For 
example, and without limitation, the nucleotide sequence of interest may be from 
about 0.1 kb to about 10 mb, for example, about 1 kb to about 1 mb. In one 
embodiment, the nucleotide sequence of interest is about 5 kb to about 5 mb in size, 
for example, about 5 kb to about 2 mb, e.g., about 8 kb to about 1 mb. In one 

15 embodiment, the nucleotide sequence of interest is about 0.5 kb to about 500 kb. 

The first recombination site and/or the nucleotide sequence which includes the 
second recombination site and a sequence of interest such as a coding sequence may 
be introduced into cells, embryos (i.e., fertilized ova) or sperm by any useful method. 
These useful methods include, without limitation, ccU fusion, lipofection, transfcction, 

20 microinjection, calcium phosphate co-precipitation, electroporation, protoplast fusion, 
particle bombardment and the like. In addition, the first recombination site or 
nucleotide sequence comprising the second recombination site and the sequence of 
interest may be introduced into cells, embryos, ova or spenn in the presence of a 
cationic polymer such as PEI and/or other substances disclosed elsewhere herein or 

25 known in the art. 

In one embodiment, recombination sites employed in the present invention are 
isolated from bacteriophage and/or bacteria. For example, the recombination sites 
may be attP sites or attB sites. 

The substance which facilitates insertion of the second recombination site and 

30 a sequence of interest may be an enzyme. In one embodiment, the substance is a site 
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specific recombinase. In one useful embodiment, the substance which faciUtates 
insertion of the nucleotide sequence is nucleic acid, for example, DNA or RNA. The 
DNA or RNA may include modified nucleosides as described elsewhere herein or are 
known to those of skill in the art. In one embodiment, modified nucleosides are 
5 employed to extend the half-life of RNA or DNA molecules employed in the present 
invention. For example, it may be desirable to extend the half life of the RNA or DNA 
molecules in the presence of a cellular environment. In one useful embodiment, the 
nucleic acid encodes an enzyme such as a site specific recombinase. 

NonUmiting examples of site specific recombinases which may be employed 

10 herein either as protein or encoded by nucleic acid include serine recombinases and 
tyrosine recombinases. Examples of serine recombinases which may be employed 
include, without limitation, EcoYBCK, 4>C31, SCH10.38c, SCC88.14, SC8F4.15c, 
SCD12A.23, Bxbl, WwK, Sau CcrB, Bsu CisB, TP901-1, <I>370.1, ^105, <3>FC1, 
A118, Cacl956, Cacl951, Sau CcrA, Spn, TnpX, indX, SPBc2, SC3C8.24, 

15 SC2E1 .37, SCD78.04C, R4, <l>Rvl , Y4bA and Bja serine recombinases. 

In one embodiment of the invention, the present methods include introducing 
an integration host factor into a ceU (e.g., an embryo) to faciUtate genomic integration. 
Such integration host factors may be particularly useful when employing certain 
substances such as tyrosine recombinases as disclosed herein. 

20 The nucleotide sequence of interest may include a coding sequence. The 

coding sequence may encode any useful protein. In one useful embodiment, the 
sequence of interest encodes a pharmaceutical or therapeutic substance. The invention 
contemplates the production of any useful protein based pharmaceutical or therapeutic 
substance. Examples of pharmaceutical or therapeutic substances include without 

25 limitation at least one of a light chain or a heavy chain of an antibody (e.g., a human 
antibody) or a cytokine. In one embodiment, the pharmaceutical or therapeutic 
composition is interferon, erythropoietin, or granulocytc-colony stimulating factor, hi 
one embodiment, the transgenic animal is an avian and the sequence of interest 
encodes a polypeptide present in eggs produced by the avian. 
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In one embodiment, integrases such as phage integrases, for example, serine 
recombinases, such as the integrase from phage phi(^31, can mediate the efficient 
integration of transgenes into target cells both in vitro and in vivo. In one 
embodiment, when a plasmid is equipped with a single attB site, the integrase detects 
5 attP homologous sequences, termed pseudo-attP sites, in a target genome and mediates 
crossover between the attB site and apseudo attP site. 

In one embodiment, once delivered to a recipient cell, for example, an avian 
cell, the phiC31 integrase mediates recombination between the att site within the 
nucleic acid molecule and a bacteriophage attachment site within the genomic DNA of 

10 the cell. Both att sites are disrupted and the nucleic acid molecule, with partial att 
sequences at each end, is stably integrated into the genome attP site. The phiC31 
integrase, by disrupting the att sites of the incoming nucleic acid and of the recipient 
site within the cell genome can preclude any subsequent reverse recombination event 
that would excise the integrated nucleic acid and reduce the overall efficiency of stable 

15 incorporation of the heterologous nucleic acid. 

Following delivery of the nucleic acid molecule and a source of integrase 
activity into a cell population and integrase-mediated recombination, the cells may be 
returned to an embryo, hi the case of avians, late stage blastodermal cells may be 
returned to a hard shell egg, which is resealed for incubation until hatching. Stage 1 

20 embryos may be directly microinjected with the polynucleotide and source of 
integrase activity, isolated, transfected and returned to a stage I embryo which is 
reimplanted into a hen for further development. Additionally, the transfected cells 
may be maintained in culture in vitro. 

The present invention provides novel methods and recombinant polynucleotide 

25 molecules for transfecting and integrating a heterologous nucleic acid molecule into 
the genome of a ceU of a vertebrate animal, such as an avian. Certain methods of the 
invention provide for the delivery to a cell population a first nucleic acid molecule that 
comprises a region encoding a recombination site, such as a bacterial recombination 
site or a bacteriophage recombination site. In one embodiment, a source of integrase 

30 activity is also delivered to the cell and can be in the form of an integrase-encoding 
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nucleic acid sequence and its associated promoter or as a region of a second nucleic 
acid molecule that may be co-delivered willi the polynucleotide molecule. 
Alternatively, integrase protein itself can be delivered directly to the target cell. 

The recombinant nucleic acid molecules of the present invention may further 
5 comprise a heterologous nucleotide sequence operably linked to a promoter so that the 
heterologous nucleotide sequence, when integrated into the genomic DNA of a 
recipient cell, can be expressed to yield a desired polypeptide. The nucleic acid 
molecule may also include a second transcription initiation site, such as an internal 
ribosome entry site (IRES), operably linked to a second heterologous polypeptide- 

10 encoding region desired to be expressed with the first polypeptide in the same cell. 

The present invention provides modified isolated artificial chromosomes useful 
as vectors to shuttle transgenes or gene clusters into a genome of an avian. By 
delivery of the modified chromosome to a recipient cell, the target cell, and progeny 
thereof, become trisomic or transchromosomic. The additional chromosome will 

1 5 typically not affect the subsequent development of Ihc recipient cell and/or embryo, 
nor interfere with the reproductive capacity of an adult bird developed from such cells 
or embryos. The chromosome wiU also be stable within the genome of the cells of the 
adult bird or within isolated avian cells. The invention provides methods to isolate a 
population of chromosomes for delivery into embryos or early cells of avians, for 

20 example, chickens. 

The methods can include inserting a lac-operator sequence into an isolated 
chromosome and, optionally, inserting a desired transgene sequence within the same 
chromosome. The lac operator region is typically a concatamer of a plurality of lac 
operators for the binding of multiple lac repressor molecules. A recombinant DNA 

25 molecule is constracted that includes an identified region of the taiget chiomosome, a 
recombination site such as attB or attP, and the lac-operator concatamer. The 
recombinant molecule is delivered to an avian cell, and homologous recombination 
wiU integrate the heterologous polynucleotide and the lac-operator concatamer into the 
targeted chromosome. A tag-polypeptide, such as the GPF-lac-repressor fusion 

30 protein, binds to the lac-operator sequence for identification and isolation of the 
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genetically modified chromosome. The tagged mitotic chromosome can be isolated 
using, for instance, flow cytometry. 

Among other things, the present invention relates to transchromosomic avians. 
In a particular aspect, the invention provides for GO founder transchromosomic avians 
5 (e.g., chimeic including, but not limited to, germline chimeric transchromosomic 
avians) which can give rise to germline transchromosomic offspring, for example, Gl 
and G2 germline transchromosomic offspring. 

Examples of avians which are contemplated for use herein include, without 
limitation, chicken, turkey, duck, goose, quail, pheasants, parrots, finches, hawks, 

10 crows andratites including ostrich, emu and cassowary. 

In one useful aspect, the artificial chromosome employed herein includes a 
centromere. Any useful centromere may be employed in the present invention 
including, without limitation, centromeres from insects, mammals or avians. 

In one particularly useful embodiment, the artificial chromosomes used herein 

15 include a heterologous nucleotide sequence. The nucleotide sequence may he 
heterologous to the avian and/or heterologous to the artificial chromosome. In one 
useful embodiment, the heterologous nucleotide sequence includes a coding sequence 
for a therapeutic substance. In addition, the heterologous nucleotide sequence may 
include a gene expression controlling region. Any useful gene expression controlling 

20 region may be employed in the invention. For example, and without limitation, the 
gene expression controlling region may include a lysozyme promoter, an ovomucin 
promoter, a conalbumin promoter, an ovomucoid promoter and/or an ovalbumin 
promoter or functional portions thereof. See, for example, US Patent Application No. 
10/1 14,739, filed April 1, 2002; US Patent Apphcation No. 10/856,218, filed May 28, 

25 2004 and US Patent Application No. 10/733,042, filed December 11, 2003. The 
disclosure of each of these patent applications Is incorporated herein by reference in its 
entirety. In one useful embodiment, the product of the heterologous nucleotide 
sequence (e.g., therapeutic substance) is delivered to the avian egg (e.g., the egg white) 
during production of the egg in the avian. The invention also includes the eggs 
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produced by the avians produced by these methods and other methods disclosed 
herein. 

Another aspect of the present invention is a cell, for example, an avian cell, 
genetically modified with a transgene vector by the methods of the invention. For 
5 example, in one embodiment, the transformed cell can be a chicken early stage 
blastodermal cell or a genetically transformed cell line, including a sustainable cell 
line. The transfected cell may comprise a transgene stably integrated into the nuclear 
genome of the recipient cell, thereby replicating with the cell so that each progeny cell 
receives a copy of the transfected nucleic acid. One useful cell line for the dchvcry 

10 and integration of a transgene comprises a heterologous attP site that can increase the 
efficiency of integration of a polynucleotide by an integrase, such as phiC3 1 integrase 
and, optionally, a region for expressing the integrase. 

Another aspect of the present invention is methods of expressing a 
heterologous polypeptide in a cell by stably transfecting a cell by using site-specific 

15 integrase-mediation and a recombinanl nucleic acid molecule, as described above, and 
culturing the transfected cell under conditions suitable for expression of the 
heterologous polypeptide under the control of a transcriptional regulatory region. 

Yet another aspect of the present invention concerns transgenic vertebrate 
animals, such as birds, for example chickens, comprising a recombinant nucleic acid 

20 molecule and which may (though optionally) express a heterologous gene in one or 
more cells in the animal. For example, in the case of avians, embodiments of the 
methods for the production of a heterologous polypeptide by the avian tissue involve 
providing a suitable vector and introducing the vector into embryonic blastodermal 
cells containing an allP site together with an integrase, for example, a serine 

25 recombinase such as phiC3 1 integrase, so that the vector can integrate into the avian 
genome at the attP site which has been engineered into the cell genome. A subsequent 
step may involve deriving a mature transgenic avian from the transgenic blastodermal 
cells by transferring the transgenic blastodermal cells to an embryo, such as a stage X 
embryo (e.g., an irradiated stage X embryo), and allowing that embryo to develop 

30 fully, so that the cells become incorporated into the bird as the embryo is allowed to 
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develop. In one embodiment, sperm from a GO bird positive for the transgene is used 
to inseminate a chicken giving rise to a fully transgenic Gl generation. 

One approach may be to transfer a transfected nucleus to an enucleated 
recipient cell which may then develop into a zygote and ultimately an adult animal. 
5 The resulting animal is then grown to maturity. 

In the transgenic vertebrate of the present invention, the expression of the 
transgene may be restricted to specific subsets of cells, tissues or developmental stages 
utilizing, for example, trans-acting factors acting on the transcriptional regulatory 
region operably linked to the polypeptide-encoding region of interest of the present 

10 invention and which control gene expression in the desired pattern. Tissue-specific 
regulatory sequences and conditional regulatory sequences can be used to control 
expression of the transgene in certain spatial patterns. Moreover, temporal patterns of 
expression can be provided by, for example, conditional recombination systems or 
prokaryotic transcriptional regulatory sequences. By inserting an integration site such 

15 as attP into the genome, it is believed that expression of an integrated coding sequence 
wUl be much more predictable. 

The invention can be used to express, in large yields and at low cost, a wide 
range of desired proteins including those used as human and animal pharmaceuticals, 
diagnostics, and livestock feed additives. Proteins such as growth hormones, 

20 cytokines, structural proteins and enzymes including human growth hormone, 
interferon, lysozyme, and P-casein may be produced by the present methods. In one 
embodiment, proteins arc expressed in the oviduct and deposited in eggs of avians, 
such as chickens, according to the invention. The present invention includes these 
eggs and these proteins. 

25 The present invention also includes methods of producing transgenic vertebrate 

animals, for example, transgenic chickens, which employ the use of integrase, cationic 
polymers and/ nuclear localization signals. The present invention also includes the 
transgenic vertebrate animals, such as the avians, produced by these methods and other 
methods disclosed herein. The invention also includes the eggs produced by the 

30 transgenic avians produced by these methods and other methods disclosed herein. 
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In one embodiment, the methods of the invention include introducing into a 
cell: 1) a nucleic acid comprising a transgene; 2) an integrase activity; and 3) a cationic 
polymer. Such methods provide for an increased efficiency of transgenic avian 
production relative to identical methods without the cationic polymer. 
5 In another embodiment, the methods include introducing into a cell: 1) a 

nucleic acid comprising a transgene; 2) an integrase activity; and 3) a nuclear 
localization signal. Such methods provide for an increased efficiency of trainsgenic 
animal, for example, avian, production relative to identical methods without the 
nuclear localization signal. 

10 In another embodiment, the methods include introducing into a cell: 1) a 

nucleic acid comprising a transgene; 2) an integrase activity; 3) a cationic polymer; 
and 4) a nuclear localization signal. Such methods provide for an increased efficiency 
of transgenic vertebrate animal production relative to identical methods without the 
cationic polymer or without the nuclear localization signal. 

15 In one embodiment, the cell is a cell of an embryo, for example, an avian 

embryo. In one embodiment, the cell is a cell of an early stage avian embryo 
comprising a germinal disc. The avian cell may be, for example, a cell of a stage I 
avian embryo, a cell of a stage II avian embryo, a cell of a stage in avian embryo, a 
cell of a stage IV avian embryo, a cell of a stage V avian embryo, a cell of a stage VI 

20 avian embryo, a cell of a stage VII avian embryo, a cell of a stage VIII avian embryo, a 
cell of a stage IX avian embryo, a cell of a stage X avian embryo, a cell of a stage XI 
avian embryo or a cell of a stage XII avian embryo. In one particularly useful 
embodiment, the avian cell is a cell of a stage X avian embryo. In another useful 
embodiment, the avian cell is a cell of a stage I avian embryo. 

25 The methods provide for the introduction of nucleic acid into the avian cell by 

any suitable technique known to those of skill in the art. For example, the nucleic acid 
may be introduced into the avian cell by microinjecting, transfection, electroporation 
or lipofection. La one particularly useful embodiment, the introduction of the nucleic 
acid is accompUshed by microinjecting. 
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The nucleic acid which includes a transgene may be DNA or RNA or a 
combination of RNA and DNA. The nucleic acid may comprise a single strand or may 
comprise a double strand. The nucleic acid may be a linear nucleic acid or may be an 
open or closed circular nucleic acid and may be naturally occurring or synthetic. 
5 Integrase activity may be introduced into the cell, such as an avian cell, in any 

suitable form. In one embodiment, an integrase protein is introduced into the cell. In 
another embodiment, a nucleic acid encoding an integrase is introduced into the cell. 
ITie nucleic acid encoding the integrase may be double stranded DNA^ single stranded 
DNA, double stranded RNA, single stranded RNA or a single or double stranded 

10 nucleic acid which includes both RNA and DNA. In one particularly useful 
embodiment, the nucleic acid is mRNA. Integrase activity may be introduced into the 
cell by any suitable technique. Suitable techniques include those described herein for 
introducing the nucleic acid encoding a transgene into a cell. In one useful 
embodiment, the integrase activity is introduced into the cell with the nucleic acid 

15 encoding the transgene. For example, the integrase activity may be introduced into the 
cell in a mixture with the nucleic acid encoding the transgene. 

In one embodiment, a nuclear localization signal (NLvS) is associated with the 
nucleic acid which includes a transgene. For example, the NLS may be associated 
with the nucleic acid by a chemical bond. Examples of chemical bonds by which an 

20 NLS may be associated with the nucleic acid include an ionic bond, a covalent bond, 
hydrogen bond and Van der Waal's force. In one particularly useful embodiment, the 
nucleic acid which includes a transgene is associated with an NLS by an ionic bond. 
NLS may be introduced into the cell by any suitable technique. Suitable techniques 
included those described herein for introducing the nucleic acid encoding a transgene 

25 into a cell. In one useful embodiment, the NLS is introduced into the cell with the 
nucleic acid encoding the transgene. For example, the NLS may be intioduced into the 
cell while associated with the nucleic acid encoding the transgene. 

Cationic polymers may be employed to facilitate the production of transgenic 
vertebrate animals such as avians. For example, the cationic polymers may be 

30 employed in combination with integrase and/or NLS. Any suitable cationic polymer 
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may be used. For example, and without limitation, one or more of polyethylenimine, 
polylysine, DEAR-dextran, starburst dendrimers and starburst polyamidoamine 
dendrimers may be used. In a particularly useful embodiment, the cationic polymer 
includes polyethylenimine. The cationic polymer may be introduced into the cell by 
5 any suitable technique. Suitable techniques included those described herein for 
introducing the nucleic acid encoding a transgene into a cell. in one useful 
embodiment, the cationic polymer is introduced into the cell in a mixture with the 
nucleic acid encoding the transgene. For example, the cationic polymer may be 
introduced into the avian ceU while associated with the nucleic acid encoding the 
1 0 transgene. 

In one particularly useful embodiment of the invention, the transgene includes 
a coding sequence which is expressed in a ceU of the transgenic vertebrate animal, for 
example, a transgenic avian, producing a peptide or a polypeptide (e.g., a protein). 
The coding sequence may be expressed in any or all of the cells of the transgenic 

15 animal. for example, the coding sequence may be exi:iressed in the blood, the 
magnum and/or the sperm of the animal. In a particularly useful embodiment of the 
invention, the polypeptide is present in an egg, for example, in the egg white, produce 
by a transgenic avian. 

'ITie present invention also includes methods of dispersing nucleic acid in a 

20 cell, for example, in an avian cell (e.g., an avian embryo cell). For example, the 
nucleic acid may be dispersed in the cytoplasm of a cell. These methods include 
introducing into a cell a nucleic acid and a dispersing agent, for example, a cationic 
polymer (e.g., polyethylenimine. polylysine, DEAE-dextran. starburst dendrimers 
and/or starburst polyamidoainine dendrimers) in an amount that will disperse the 

25 nucleic acid in a cell. Typically, the dispersing of the nucleic acid is a homogeneous 
dispersing. In one embodiment, the dispersed nucleic acid includes a transgene. NLS 
or intcgrasc activity may also be introduced into the cell. Dispersing of the nucleic 
acid may be particularly useful when the DNA is introduced into a ceU containing a 
relatively large volume of cytoplasm, such as an avian embryo cell or a germinal disc. 

30 Dispersing of the nucleic acid in the cell can increase the likelihood that the nucleic 
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acid will contact and enter the nucleus of the ccU into which the nucleic acid has been 
introduced. Without such dispersing, the nucleic acid may localize to one or more 
areas within the ceU and may not contact the nucleus of the cell. Li addition, where the 
quantity of nucleic acid introduced into the cell is known, dispersing of the nucleic 
5 acid can assist in exposing the nucleus in the cell to known or specific concentrations 
of the nucleic acid. 

The methods of the invention include introducing the cell into a recipient 
animal, for example, an avian such as a chicken, wherein the recipient avian produces 
an offspring which includes the transgene. The cell may be introduced into a recipient 

10 animal by any suitable technique. 

The present invention also includes the identification of certain regions in the 
genome which are advantageous for heterologous gene expression. These regions can 
be identified by analysis, using methods known in the art, of the transgenic vertebrate 
animals or cells produced as disclosed herein. 

15 The production of vertehralc animals or avians which are the mature animals 

developed from the recombinant embryos, ovum and/or sperm of the invention 
typically are referred to as the GO generation and are usually hemizygous for each 
inserted transgene. The GO generation may be bred to non-transgenic animals to give 
rise to Gl transgenic offspring which are also hemizygous for the transgene. 'ITie Gl 

20 hemizygous offspring may be bred to non-transgenic animals giving rise to G2 
hemizygous offspring or may be bred together to give rise to G2 offspring 
homozygous for the transgene. In one embodiment, hemizygotic G2 offspring from 
the same line can be bred to produce G3 offspring homozygous for the transgene. In 
one embodiment, hemizygous GO animals are bred together to give rise to 

25 homozygous Gl offspring. These are merely examples of certain useful breeding 
schemes. The present invention contemplates the employment of any useful breeding 
scheme such as those known to individuals of ordinary skill in the art. 

In one embodiment, the production of transchromosomic avians which are 
mature avians developed from the recombinant embryos, ovum and/or sperm of the 

30 invention typically are referred to as the GO generation and are usually chimeric for the 
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artificial chromosome. The GO generation may be bred to non-transgenic (non- 
transchromosomic) birds to give rise to Gl transchromosomic offspring which contain 
the artificial chromosome in their genome in all or most cells in the bird. The Gl 
offspring may in turn be bred to non- transchromosomic birds giving rise lo G2 
5 offspring with a single copy of the artificial chromosome in their genome. It is also 
contemplated that birds which contain the artificial chromosome in their genome in all 
or most cells (e.g., Gl and/or G2 birds) may be bred together to give rise to offspring 
containing two of the artificial chromosome in their genome. It is contemplated that 
this process can be repeated, for example, by crossing the offspring containing two 

10 copies of the artificial chromosome in their genome, thus producing birds containing 
multiple copies, for example, four copies of the artificial chromosome in their genome. 
It is contemplated that this process can be repeated or modified as would be 
understood by a practitioner of skiU in the art to obtain a bird wilh a desired number of 
artificial chromosomes contained in its genome. In one useful embodiment, artificial 

15 chromosomes of different types or which contain different nucleotide sequences are 
individually introduced into the genome of individual avians which are bred to 
produce an avian containing more than one type of artificial chromosome in its 
genome. 

In one aspect, transchromosomic avians of the invention have a genome which 
20 includes a transgene of greater than about 5,000 nucleotides in length. In another 
aspect, transchromosomic avians of the invention have a genome which includes a 
transgene of between about 5,000 and about 50,000,000 nucleotides in length. For 
example, the transgene may be between about 5,000 nucleotides in length and about 
5,000,000 nucleotides in length. In one embodiment, the transgene is between about 
25 5,000 nucleotides in length and about 1,000,000 nucleotides in length. For example, 
the transgene may be between about 5,000 nucleotides in length and about 500,000 
nucleotides in length. 

In one aspect, transchromosomic avians of the invention have a genome which 
includes a transgene greater than about 8,000 nucleotides in length. In another aspect, 
30 transchromosomic avians of the invention have a genome which includes a transgene 
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of between about 8,000 and about 50,000,000 nucleotides in length. For example, the 
transgcnc may be between about 8,000 nucleotides in length and about 5,000,000 
nucleotides in length. In one embodiment, the transgene is between about 8,000 
nucleotides in length and about 1,000,000 nucleotides in length. For example, the 
5 transgene may be between about 8,000 nucleotides in length and about 500,000 
nucleotides in length. 

In one particularly useful embodiment, the transchromosomic avians of the 
invention lay eggs which contain one or more heterologous proteins, for example, one 
or more proteins (e.g., certain pharmaceutical proteins) which are heterologous or 

10 exogenous to the egg. The eggs may contain any useful amount of heterologous 
protein. In one embodiment, the eggs contain the heterologous protein in an amount 
greater than about 0.01 JXg per hard-shell egg. For example, the eggs may contain the 
heterologous protein in an amount in a range of between about 0.01 jxg per hard-shell 
egg and about 2 grams per hard-shell egg. hi one embodiment, the eggs contain 

15 between about 0.1 \ig per hard-shell egg and about 1 gram per hard-shell egg. For 
example, the eggs may contain between about 1 Jig per hard-shell egg and about 1 
gram per hard-shell egg. In one embodiment, the eggs contain between about 1 [ig per 
hard-shell egg and about 1 gram per hard-shell egg. For example, the eggs may 
contain between about 10 Jig per hard-shell egg and about 1 gram per hard-shell egg 

20 (e.g., the eggs may contain between about 10 jxg per hard-shell egg and about 100 mg 
per hard-shell egg). 

In one useful embodiment, the heterologous protein is present in the egg 
white of the eggs. In another useful embodiment, the heterologous protein is present 
in the egg white and is substantially not present in the egg yolk of the eggs. 

25 In one embodiment, the heterologous protein is present in egg white in an 

amount greater than about 0.01 jlg per ml of the egg. In another embodiment, the 
heterologous protein is present in egg white in an amount in a range of between about 
0.01 \lg per ml of the egg white and about 0.2 gram per ml of the egg white. For 
example, the heterologous protein may be present in egg white in an amount in a range 

30 of between about 0.1 jlg per ml of the egg white and about 0.5 gram per ml of the egg 
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white. In one embodiment, the heterologous protein is present in egg white in an 
amount in a range of between about 1 |i.g per ml of the egg white and about 0.2 gram 
per ml of the egg white. For example, the heterologous protein may be present in egg 
white in an amount in a range of between about 1 Jig per ml of the egg white and about 
5 0.1 gram per ml of the egg white (e.g., the heterologous protein may be present in egg 
white in an amount in a range of between about 1 [ig per ml of the egg white and about 
10 mg per ml of the egg white). 

Certain publications considered to be useful in the present invention, the 
disclosures of which are incorporated in their entirety herein by reference, include: 

10 ladonato et al (1996) RARE-cleavage analysis of YACs, Methods Mol Biol 54: 75-85; 
Popov et al. (1999) A human immunoglobulin lambda locus is similarly well 
expressed in mice and humans, J Rxp Med 189(10): 161 1-20; Call et al. (2000) A cre- 
lox recombination system for the targeted integration of circular yeast artificial 
chromosomes into embryonic stem cells, Hum Mol Genet 9(12): 1745-51; Csonka cl 

15 al. (2000) Novel generation of human satellite DNA-based artificial chromosomes in 
mammalian cells, .lournal of ('ell vSciencc 1 13, 3207-3216; (lOgel et al. (1996) 
Mapping of replication initiation sites in the mouse ribosomal gene cluster, 
Chromosoma 104, 511-518; Peterson et al. (1998) LCR-dependent gene expression in 
beta-globin YAC transgenics: detailed structural studies validate functional analysis 

20 even in the presence of fragmented YACs, Hum Mol Genet 7(1 3): 2079-88; Marschall 
et al. (1999) Transfer of YACs up to 2.3 mb intact into human cells with 
polyethylenimine. Gene Ther 6(9): 1634-7; Basu, J., G. Stromberg et al. (2005) Rapid 
creation of BAC-based human artificial chromosome vectors by transposition with 
synthetic alpha-satellite arrays. Nucleic Acids Res 33(2): 587-96; Lindenbaum et al. 

25 (2004) A mammalian artificial chromosome engineering system (ACR vSystem) 
appUcable to biopharmaceutical protein production, transgenesis and gene-based cell 
therapy. Nucleic Acids Res 32(21): el72; Nicholson et al. (1999) Antibody repertoires 
of four- and five-feature translocus mice carrying human immunoglobulin heavy chain 
and kappa and lambda light chain yeast artificial chromosomes, J Immunol 163(12): 

30 6898-906; Huxley (1994) Genetic Engineering. J. K. Sedow, New York, NY, Plenum 
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Press, 16: 65-91; Harvey et al. (2002) Consistent Production of Transgenic Chickens 
using Replication Deficient Retroviral Vectors and High-throughput Screening 
Procedures, Poultry Science 81(2): 202-12; Tomizuka et al (1997) Functional 
expression and germline transmission of a human chromosome fragment in chimeric 
5 mice. Nature Genetics 16:133-143; and Williams et al (1993) Cloning and sequencing 
of human immunoglobulin V-lambda gene segments, Eur J Immunol 23:1456-1461. 

Any useful combination of features described herein is included within the 
scope of the present invention provided that the features included in any such 
combination are not mutually inconsistent as will be apparent from the context, this 

10 specification, and the knowledge of one of ordinary skiU in the art. For example, the 
term transgenic can encompass the term transchromosomal and methodologies useful 
for transgenic animals (e.g., avians) and cells disclosed herein may also be employed 
for transchromosomal avians and avian cells. 

Additional objects and aspects of the present invention will become more 

15 apparent upon review of the detailed description set forth below when taken in 
conjunction with the accompanying figures, which are briefly described as follows. 

Brief Description of the Figures 

Fig. 1 illustrates phage integrase-mediated integration. A plasmid vector 
20 bearing the transgene includes the attB recognition sequence for the phage integrase. 
The vector along with integrase-coding mRNA, a vector expressing the integrase, or 
the integrase protein itself, are dehvered into cells or embryos. The integrase 
recognizes DNA sequences in the avian genome similar to attP sites, termed pseudo- 
attP, and mediates recombination between the attB and pseudo-attP sites, resulting in 
25 the permanent integration of the transgene into the avian genome. 

Fig. 2 illustrates the persistent expression of luciferase from a nucleic acid 
molecule after phiC3 1 intcgrasc-mcdiatcd integration into chicken cells. 

Fig. 3 illustrates the results of a puromycin resistance assay to measvu"e phiC3 1 
integrase-mediated integration into chicken cells. 
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Fig. 4 illustrates phiC31 integrase-mediated integration into qusdl cells. 
Puromycin resistance vectors bearing attB sites were cotransfected with phiC31 
integrase, or a control vector, into QT6 cells, a quail fibrosarcoma cell line. One day 
after transfection, puromycin was added. Puromycin resistant colonies were counted 
5 12 days post-transfection. 

Figs. 5A and 5B illustrate that phiC31 integrase can facihtate multiple 
integrations per avian cell. A puromycin resistance vector bearing an attB site was 
cotransfected with an enhanced green fluorescent protein (EGFP) expression vector 
bearing an attB site, and a phiCBl integrase expression vector. After puromycin 
10 selection, many puromycin resistant colonies expressed EGFP in all of their cells. 
Figs. 5A and 5B are the same field of view with EGFP illuminated with ultraviolet 
light (Fig. 5A) and puromycin resistant colonies photographed in visible light (Fig. 
5B). In Fig. 5B, there arc 4 puromycin resistant colonics, two of which arc juxtaposed 
at the top. One of these colonies expressed HGFP. 
15 Fig. 6 shows maps of the small vectors used for integrase assays. 

Fig. 7 shows integrase promotes efficient integration of large transgenes in 
avian cells. 

Fig. 8 shows maps of large vectors used for integrase assays. 

Fig. 9a and b illustrates the nucleotide sequence of the integrase-expressing 
20 plasmid pCMV-3 lint (SEQ ID NO: 1). 

I'ig. 10a and b illustrates the nucleotide sequence of the plasmid pCMV-luc- 
attB (SEQ ID NO: 2). 

Fig. 11a and b illustrates the nucleotide sequence of the plasmid pCMV-luc- 
attP (SEQ ID NO: 3). 

25 Fig. 12a and b illustrates the nucleotide sequence of the plasmid pCMV-pur- 

attB (SEQ ID NO: 4). 

Fig. 13a and b illustrates the nucleotide sequence of the plasmid pCMV-pur- 
attP (SEQ ID NO: 5). 

Fig. 14a and b illustrates the nucleotide sequence of the plasmid pCMV-EGFP- 
30 attB (SEQ ID NO: 6). 
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Fig. 15a to f illustrates the nucleotide sequence of the plasmid pl2.0-lys- 
LSPlPNMM-CMV-pur-attB (SHQ ID NO: 7). 

Fig. 16a to f illustrates the nucleotide sequence of the plasmid pOMIFN-Ins- 
CMV-pur-attB (SEQ ID NO: 8). 
5 Fig. 17a and b illustrates the nucleotide sequence of the integrase-expressing 

plasmid pRSV-lnt (SEQ ID NO: 9). 

Fig. 18a and b illustrates the nucleotide sequence of the plasmid p('R-XL- 
TOPO-CMV-pur-attB (SEQ ID NO: 10). 

Fig. 19 illustrates the nucleotide sequence of the attP containing polynucleotide 
10 SEQ ID NO: 11. 

Fig. 20 illustrates in schematic from the integration of a heterologous att 
recombination site into an isolated chromosome. The attB sequence is linked to 
selectable marker such as a puromycin expression cassette and is flanked by sequences 
found in the target site of the chromosome to be modified. The DNA is transfected 
15 into cells containing the chromosome and stable Iransfectants are selected for by drug 
resistance. Site specific integration may be confirmed by several techniques including 
PGR. 

Fig. 2 1 illustrates the persistent expression of luciferase from a nucleic acid 
molecule after phiC31 integrase-mediated integration into chicken cells bearing a 
20 wild-type attP sequence. 

Fig. 22 iUustrales the distribution of plasmid DNA in a stage I embryo. 

Fig. 23 illustrates the distribution of plasmid DNA in a stage I embryo in the 
presence of low molecular weight polyethylenimine. 

Fig. 24 illustrates the distribution of plasmid DNA in a stage I embryo in the 
25 presence of low molecular weight polyethylenimine. 

Fig. 25 illustrates the integration of a gene of interest (i.e., transgene OMC24- 
IRES-EPO) into an artificial chromosome by integration (which takes place inside of a 
host cell) wherein cells containing the recombinant chromosome can be selected for 
based on hygromycin resistance. 
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Fig. 26 illustrates the insertion of a nucleotide sequence of interest (A) into an 
attP site contained in an ALV genome which has been integrated into a chicken 
chromosome (B). The nucleotide sequence can be introduced into a cell containing the 
ALV genome by any useful method such as microinjection or transduction. For 
5 example, the nucleotide sequence can be introduced into an avian egg or germinal disc 
at any useful stage of development. For example, the nucleotide sequence can be 
introduced into a stage X egg by transduction. In another example, the nucleotide 
sequence can be introduced into a stage I egg by microinjection. 

Fig. 27 shows human light-chain locus (27 A) and heavy-chain locus (27B) 

10 containing YACs. V regions are numbered according to their gene family and their 
position in the locus, following the system of Lefranc et al (1999) IMCT, the 
international ImMuunoGenTics database Nucleic Acids Res. 27:209, the disclosure of 
which is incorporated in its entirety herein by reference. The Ig Heavy YAC contains 
the complete D and J region loci, the intro enhancer (not marked) and the lg|l and Ig5 

15 C regions. The Igl.ambda YAC contains the seven paired AJ and C regions, four of 
which are functional, and the 3' enhancer. 

Definitions and Abbreviations 

For convenience, definitions of certain terms and certain abbreviations 
20 employed in the specification, examples and appended claims are collected here. 

Abbreviations used in the present specification include the following: aa, 

amino acid(s); bp, base pair(s); kb, kilobase(s): mb, megabase(s): art, bacterial 

recomhination attachment site; lU, infectious units; mg, milligram(s); jj,g, 

microgram(s); ml, milliliter (s). 
25 As used in this specification and the appended claims, the singular forms "a," 

"an" and "the" include plural references unless the content clearly dictates otherwise. 

Thus, for example, reference to "an antigen" includes a mixture of two or more such 

agents. 

The term "antibody" as used herein refers to polyclonal and monoclonal 
30 antibodies and fragments thereof, and immunologic binding equivalents thereof. 
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Antibodies may include, but arc not limited to polyclonal antibodies, monoclonal 
antibodies (mAbs), humanized or chimeric antibodies, single chzdn zintibodies. Fab 
fragments, F(ah')^ fragments, fragments produced by a Fab expression library, anti- 
idiotypic (anti-Id) antibodies, and epitope-binding fragments of any of the above. 
5 As used herein, an "artificial chromosome" is a nucleic acid molecule that can 

stably replicate and segregate alongside endogenous chromosomes in a cell. Artificial 
chromosomes have the capacity to act as gene delivery vehicles by accommodating 
and expressing foreign genes contained therein. A mammalian artificial chromosome 
(MAC) refers to chromosomes that have an active mammalian centromere(s). Plant 

10 artificial chromosomes, insect artificial chromosomes and avian artificial 
chromosomes refer to chromosomes that include plant, insect and avian centromeres, 
respectively. A human artificial chromosome (HAC,) refers to chromosomes that 
include human centromeres. For exemplary artificial chromosomes, see, for example, 
U.S. Fat. Nos. 6,025,155, issued I cbruary L5. 2000; 6,077,697, issued June 6, 2000; 

15 5,288,625, issued Februaiy 22, 1994; 5,712,134, issued Januaj-y 27, 1998; 5,695,967, 
issued December 9, 1997; 5,869,294, issued February 9, 1999; 5,891,691, issued April 
6, 1999 and 5,721,118, issued February 24, 1998 and pubUshed International PCX 
apphcation Nos., WO 97/40183, pubUshed October 30, 1997; WO 98/08964, 
published March 5, 1998, published US Patent Applications, vSerial Nos. 08/835,682, 

20 filed April 10, 1997; 10/151,078, filed May 16, 2002; 10/235,119, filed September 3, 
2002; 10/086,745, filed February 28, 2002, the disclosures of which are incorporated 
herein in their entireties by reference. The term "chromosome" may be used 
interchangeably with the term "artificial chromosome" as will be apparent based on 
the context of such use. 

25 Foreign genes that can be contained in artificial chromosome expression 

systems can include, but are not limited to, nucleic acid that encodes therapeutically 
effective substances, such as anti-cancer agents, enzymes, hormones and antibodies. 
Other examples of heterologous DNA include, but are not limited to, DNA that 
encodes traceable marker proteins (reporter genes), such as fluorescent proteins, such 

30 as green, blue or red fluorescent proteins (GFP, BFP and RPP, respectively), other 
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reporter genes, such as beta-galactosidase and proteins that confer drug resistance, 
such as a gene encoding hygiomycin-resistance. 

The term "avian" as used herein refers to any species, subspecies or race of 
organism of the taxonomic class ava, such as, but not limited to chicken, turkey, duck, 
5 goose, quail, pheasants, paiTots, finches, hawks, crows and ratites including ostrich, 
emu and cassowary. The term includes the various known strains of Gallus gallus, or 
chickens, (for example, White Leghorn, Brown Leghorn, Barred-Rock, Sussex, New 
Hampshire, Rhode Island, Australorp, Minorca, Amrox, California Gray), as well as 
strains of turkeys, pheasants, quails, duck, ostriches and other poultry commonly bred 

10 in commercial quantities. It also includes an individual avian organism in all stages of 
development, including embryonic and fetal stages. The term "avian" also may denote 
"pertaining to a bird", such as "an avian (bird) cell." 

The tenns "chimeric animal" or "mosaic animal" are used herein to refer to an 
animal in which a nucleotide sequence of interest is found in some but not all cells of 

15 the animal, or in which the recombinant nucleic acid is expressed, in some but not all 
cells of the animal. The term "tissue-specific chimeric animal" indicates that the 
recombinant gene is present and/or expressed in some tissues but not others. 

The term "coding region" as used herein refers to a continuous linear 
arrangement of nucleotides which may be translated into a polypeptide. A full length 

20 coding region is translated into a full length protein; that is, a complete protein as 
would be translated in its natural state absent any post-translational modifications. A 
full length coding region may also include any leader protein sequence or any other 
region of the protein that may be excised natin^ally from the translated protein. 

The term "cytokine" as used herein refers to any secreted polypeptide that 

25 affects a function of cells and modulates an interaction between cells in the immune, 
inflammatory or hematopoietic response. A cytokine includes, but is not limited to, 
monokines and lymphokines. Examples of cytokines include, but are not limited to, 
interferon oGb, Literleukin-1 (IL-I), Interleukin-6 (IL-6), Interleukin-8 (IL-8), Tumor 
Necrosis Factor-a(TNF-a) and Tumor Necrosis Factor a(TNF- CO- 

30 As used herein, "delivery," which is used interchangeably with "transfection," 
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refers to the process by which exogenous nucleic acid molecules are transferred into a 
cell such that they are located inside the cell. 

As used herein, "DNA" is meant to include all types and sizes of DNA 
molecules including cDNA, plasmids and DNA including modified nucleotides and 
5 nucleotide analogs. 

The term "expressed" or "expression" as used herein refers to the transcription 
from a gene to give an RNA nucleic acid molecule at least complementary in part to a 
region of one of the two nucleic acid strands of the gene. The term "expressed" or 
"expression" as used herein may also refer to the translation from an RNA molecule to 

10 give a protein, a polypeptide or a portion thereof. In one embodiment, for 
heterologous nucleic acid to be expressed in a host cell, it must initially be dehvercd 
into the cell and then, once in the cell, ultimately reside in the nucleus. 

The term "gene" or "genes" as used herein refers to nucleic acid sequences that 
encode genetic infomiation for the synthesis of a whole RNA, a whole protein, or any 

15 portion of such whole RNA or whole protein. Genes that arc not naturally part of a 
particular organism's genome are referred to as "foreign genes," "heterologous genes" 
or "exogenous genes" and genes that are naturally a part of a particular organism's 
genome are referred to as "endogenous genes". The term "gene product" refers to an 
RNA or protein that is encoded by the gene. "Endogenous gene products" are RNAs 

20 or proteins encoded by endogenous genes. "Heterologous gene products" are RNAs or 
proteins encoded by "foreign, heterologous or exogenous genes" and are, therefore, not 
naturally expressed in the cell. 

As used herein, the terms "heterologous" and "foreign" with reference to 
nucleic acids, such as DNA and RNA. arc used interchangeably and refer to nucleic 

25 acid that does not occur naturally as part of a chromosome, a genome or cell in which 
it is present or which is found in a location(s) and/or in amounts that differ from the 
location(s) and/or amounts in which it occurs in nature. It can be nucleic acid that is 
not endogenous to the genome, chromosome or cell and has been exogenously 
introduced into the genome, chromosome or cell. Examples of heterologous DNA 

30 include, but are not limited to, DNA that encodes a gene product or gene product(s) of 
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interest, for example, for production of an encoded protein. Examples of heterologous 
DNA include, but are not limited to, DNA that encodes traceable marker proteins, 
DNA that encodes therapeutically effective substances, such as anti-cancer agents, 
enzymes and hormones and as antibodies. The terms "heterologous" and "exogenous" 
5 in general refer to abiomolecule such as a nucleic acid or a protein that is not normally 
found in a certain ceU, tissue or other component contained in or produced by an 
organism. For example, a protein that is heterologous or exogenous to an egg is a 
protein that is not normally found in the egg. 

The term "immunoglobulin polypeptide" as used herein refers to a constituent 

10 polypeptide of an antibody or a polypeptide derived therefrom. An "immunological 
polypeptide" may be, but is not limited to, an immunological heavy or light chain and 
may include a variable region, a diversity region, joining region and a constant region 
or any combination, variant or truncated form thereof. The term "immunological 
polypeptides" further includes single-chain antibodies comprised of, but not limited to, 

15 an immunoglobulin heavy chain variable region, an immunoglobuUn light chain 
variable region and optionally a peptide linker. 

The terms "integrase" and "integrase activity" as used herein refer to a nucleic 
acid recombinase of the serine recombinase family of proteins. 

The term "internal ribosome entry sites (IRES)" as used herein refers to a 

20 region of a nucleic acid, most typically an RNA molecule, wherein eukaryotic 
initiation of protein synthesis occurs far downstream of the 5^ end of the RNA 
molecule. A 43 S pre-initiation complex comprising the elfZ protein bound to GTP 
and Met-tRNAi^et^ the 40S ribosomal subunit, and factors elf3 and 3IfIA may bind to 
an "IRES" before locating an AUG start codon. An "IRES" may be used to initiate 

25 translation of a second coding region downstream of a first coding region, wherein 
each coding region is expressed individually, but under the initial control of a single 
upstream promoter. An "IRES" may be located in a eukaryotic cellular mRNA. 

As used herein, the term "large nucleic acid molecules" or "large nucleic acids" 
refers to a nucleic acid molecule of at least about 0.05 mb in size, greater than 0.5 mb, 

30 including nucleic acid molecules at least about 0.6, 0.7, 0.8, 0.9, 1, 5, 10, 30, 50 and 
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100, 200, 300, 500 mb in size. Large nucleic acid molecules typically can be on the 
order of about 10 to about 450 or more mb, and can be of various sizes, such as, for 
example, from about 250 to about 400 mb, about 150 to about 200 mb, about 90 to 
about 120 mb, about 60 to about 100 mb and about 15 to 50 mb. A large nucleic acid 
5 molecule may be larger than about 8 kb (e.g., about 8 kb to about 1 mb) as will be 
apparent based on the context. 

Examples of large nucleic acid molecules include, but are not limited to, 
natural chromosomes and fragments thereof, especially mammalian chromosomes and 
fragments thereof which retain a centromere or retain a centromere and telomeres, 

1 0 artificial chromosome expression systems (ACEs which include a mouse centromere; 
also called satelhte DNA-based artificial chromosomes (SATACs); see U.S. Pat. Nos. 
6,025,155, issued February 15; and 6,077,697, issued June 20, 2000), mammalian 
artificial chromosomes (MACs), plant artificial chromosomes, insect artificial 
chromosomes, avian artificial chromosomes and minichromosomes (sec, e.g., U.vS. 

15 Pat. Nos. 5,712,134, issued .January 27, 1998; 5,891,691, issued April 6, 1999; and 
5,288,625, issued February 22, 1994). Useful large nucleic acid molecules can include 
a single copy of a desired nucleic acid fragment encoding a particular nucleotide 
sequence, such as a gene of interest (transgene of interest), or can carry multiple copies 
thereof or multiple genes or different heterologous sequences of nucleotides. For 

20 example, the chromosomes may carry 1 to about 100 or 1 to about 1000 or even more 
copies of a gene of interest. Large nucleic acid molecules can be associated with 
proteins, for example chromosomal proteins, that typically function to regulate gene 
expression and/or participate in determining overall structure. 

A "monoclonal antibody" is an antibody in a population of antibodies each of 

25 which have the same primary structure. 

"Native" as used herein means being naturally associated with or a substance 
that is produced by a component or organism of interest (in which case the substance 
would be native to the component or organism) or being in an original form. 

A "nucleic acid fragment of interest" or "nucleotide sequence of interest" may 

30 be a trait-producing sequence, by which it is meant a sequence conferring a non-native 
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trait upon the cell in which the protein encoded by the trait-producing sequence is 
expressed. The term "non-native" when used in the context of a trait-producing 
sequence means that the trait produced is different than one would find in an 
unmodified organism which can mean that the organism produces high amounts of a 
5 natural substance in comparison to an unmodified organism, or produces a non-natural 
substance. For example, the genome of a bird could be modified to produce proteins 
not normally produced in birds such as, for example, useful animal proteins (e.g., 
human proteins) such as hormones, cytokines and antibodies. 

A nucleic acid fragment of interest may additionally be a "marker nucleic acid" 

10 or expressed as a "marker polypeptide". Marker genes encode proteins that can be 
easily detected in transformed cells and are, therefore, useful in the study of those 
cells. Examples of suitable marker genes include P-galactosidase, green or yellow 
fluorescent proteins, enhanced green fluorescent protein, chloramphenicol acetyl 
transferase, luciferase, and the like. Such regions may also include those 5' noncoding 

15 sequences involved with initiation of transcription and translation, such as the 
enhancer, 'I'A'l'A box, capping sequence, CAA'l' sequence, and the like. 

As used herein, "nucleic acid" refers to a polynucleotide containing at least two 
covalently linked nucleotide or nucleotide analog subunits. A nucleic acid can be a 
deoxyribonucleic acid (DNA), a ribonucleic acid (RNA), or an analog of DNA or 

20 RNA. Nucleotide analogs arc commercially available and methods of preparing 
polynucleotides containing such nucleotide analogs are known (Lin et al. (1994) Nucl. 
Acids Res. 22:5220-5234; JelUnek et al. (1995) Biochemistry 34:11363-11372; 
Pagratis et al. (1997) Nature Biotechnol. 15:68-73). The nucleic acid can be single- 
stranded, double- stranded, or a mixture thereof. For purposes herein, unless specified 

25 otherwise, the nucleic acid is double-stranded, or if it is apparent from the context that 
the nucleic acid is not double stranded. Nucleic acids include any natural or synthetic 
linear and sequential array of nucleotides and nucleosides, for example cDNA, 
genomic DNA, mRNA, tRNA, oligonucleotides, oUgonucleosides and derivatives 
thereof. For ease of discussion, certain nucleic acids may be collectively referred to 

30 herein as "constructs," "plasmids," or "vectors." 
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Techniques useful for isolating and characterizing the nucleic acids and 
proteins of the present invention are well known to those of skill in the art and 
standard molecular biology and biochemical manuals may be consulted to select 
suitable protocols without undue experimentation. See, for example, Sambrook et al, 
5 1989, "Molecular Cloning: A Laboratory Manual", 2nd ed.. Cold .Spring Harbor, the 
content of which is herein incorporated by reference in its entirety. 

A "nucleoside" is conventionally understood by workers of skiU in fields 
related to the present invention as comprising a monosaccharide linked in glycosidic 
linkage to a purine orpyrimidine base. A "nucleotide" comprises a nucleoside with at 

1 0 least one phosphate group appended, typically at a 3 ' or a 5 ' position (for pentoses) of 
the saccharide, but may be at other positions of the saccharide. A nucleotide may be 
abbreviated herein as "nt." Nucleotide residues occupy sequential positions in an 
oligonucleotide or a polynucleotide. Accordingly a modification or derivative of a 
nucleotide may occur at any sequential position in an oligonucleotide or a 

15 polynucleotide. All modified or derivatized oligonucleotides and polynucleotides are 
encompassed within the invention and fall within the scope of the claims. 
Modifications or derivatives can occur in the phosphate group, the monosaccharide or 
the base. 

By way of nonlimiting examples, the following descriptions provide certain 
20 modified or derivatized nucleotides. The phosphate group may be modified to a 
thiophosphate or a phosphonate. The phosphate may also be derivatized to include an 
additional esterilTed group to form a Iricster. The monosaccharide may be modified by 
being, for example, a pentose or a hexose other than a ribose or a deoxyribose. The 
monosaccharide may also be modified by substituting hydryoxyl groups with hydro or 
25 amino groups, by esterifying additional hydroxyl groups. The base may be modified 
as weU. Several modified bases occur naturally in various nucleic acids and other 
modifications may mimic or resemble such naturally occurring modified bases. 
Nonlimiting examples of modified or derivatized bases include 5-fluorouracil, 5- 
bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5- 
30 (carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5- 
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carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2- 
methyladenine, 2-methylguanme, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
5 beta-D-mannosylqueosine, S'-methoxycarboxymethyluracil , 5-methoxyuracil, 2- 
methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, 
pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4- 
thiouracil, 5-me1iiyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid 
(v), 5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6- 

10 diaininopurine. Nucleotides may also be modified to harbor a label. Nucleotides may 
also bear a fluorescent label or a biotin label. 

The term "operably linked" refers to an arrangement of elements wherein the 
components so described are configured so as to perform their usual function. Control 
sequences operably linked to a coding sequence are capable of effecting the expression 

15 of the coding sequence. The control sequences need not be contiguous with the coding 
sequence, so long as they function to direct the expression thereof. For example, 
intervening untranslated yet transcribed sequences can be present between a promoter 
sequence and the coding sequence and the promoter sequence can stiU be considered 
"operably linked" to the coding sequence. 

20 "Therapeutic proteins" or "pharmaceutical proteins" include an amino acid 

sequence which in whole or in part makes up a drug. In one embodiment, a 
pharmaceutical composition or therapeutic composition includes one or more 
pharmaceutical proteins or therapeutic proteins. 

The terms "polynucleotide," "oligonucleotide," and "nucleic acid sequence" 

25 are used interchangeably herein and include, but are not limited to, coding sequences 
(polynucleotide(s) or nucleic acid sequence(s) which are transcribed and translated 
into polypeptide in vitro or in vivo when placed under the control of appropriate 
regulatory or control sequences); control sequences (e.g., translational start and stop 
codons, promoter sequences, ribosome binding sites, polyadenylation signals, 

30 transcription factor binding sites, transcription termination sequences, upstream and 
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downstream regulatory domains, enhancers, silencers, and the like); and regulatory 
sequences (DNA sequences to which a transcription factor(s) binds and alters the 

activity of a gene's promoter either positively (induction) or negatively (repression). 
No limitation as to length or to synthetic origin are suggested by the terms described 
5 above. 

As used herein the terms "peptide," "polypeptide" and "protein" refer to a 
polymer of amino acids in a serial array, linked through peptide bonds. A "peptide" 
typically is a polymer of at least two to about 30 amino acids linked in a serial array by 
peptide bonds. The term "polypeptide" includes proteins, protein fragments, protein 

10 analogues, oligopeptides and the like. The term "polypeptides" contemplates 
polypeptides as defined above that are encoded by nucleic acids, produced through 
recombinant technology (isolated from an appropriate source such as a bird), or 
synthesized. The term "polypeptides" further contemplates polypeptides as defined 
above that include chemically modified amino acids or amino acids covalently or 

1 5 noncovalently linked to labeling moieties. 

The terms "percent sequence identity" or "percent sequence similarity" as used 
herein refer to the degree of sequence identity between two nucleic acid sequences or 
two amino acid sequences as determined using the algorithm of Karlin & Attschul, 
Proc. Natl. Acad. Sci. 87: 2264-2268 (1990), modified as in Karlin & Attschul, Proc. 

20 Nad. Acad. Sci. 90: 5873-5877 (1993). Such an algorithm is incorporated into the 
NBLAST and XBLASl' programs of Attschul ct al, 1990, T. Mol. Biol. 215: 403-410. 
BLAST nucleotide searches are performed with the NBLAST program, score = 100, 
word length = 12, to obtain nucleotide sequences homologous to a nucleic acid 
molecule of the invention. BLAST protein searches are performed with the XBLAST 

25 program, score = 50, word length ~ 3, to obtain amino acid sequences homologous to a 
reference polypeptide. To obtain gapped alignments for comparison purposes, Gapped 
BLAST is utiUzed as described in Attschul et al, Nucl. Acids Res. 25: 3389-3402 
(1997). When utihzing BLAST and Gapped BLAST programs, the default parameters 
of the respective programs (e.g. XBLAST and NBLAST) are used. Other algorithms, 

30 programs and default settings may also be suitable such as, but not only, the GCG- 
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Sequence Analysis Package of the U.K. Human Genome Mapping Project Resource 
Centre that includes programs for nucleotide or amino acid sequence comparisons. 
Examples of useful algorithms are FASTA andBESTFIT. 

The term "polyclonal antibodies" as used herein refers to a population of 
5 antibodies each of which recognize the same antigen or each of which recognize an 
antigen of a substance which contains one or more antigens. 

The term "promoter" as used herein refers to the DNA sequence that 
determines the site of transcription initiation by an RNA polymerase. A "promoter- 
proximal element" is a regulatory sequence generally within about 200 base pairs of 

10 the transcription start site. 

The term "pseudo-recombination site" as used herein refers to a site at which 
an integrase can faciUtate recombination even though the site may not have a sequence 
identical to the sequence of its wild-type recombination site. For example, a phiC3 1 
integrase and vector carrying aphiC3 1 wild-lype recombination site can be placed into 

15 an avian cell. The wild-type recombination sequence aligns itself with a sequence in 
the avian cell genome and the integrase facilitates a recombination event. When the 
sequence from the genomic site in the avian cell, where the integration of the vector 
took place, is examined, the sequence at the genomic site typically has some identity 
to, but may not be identical with, the wild-type bacterial genome recombination site. 

20 The recombination site in the avian cell genome is considered to be a pseudo- 
recombination site (e.g., a pscudo-attP site) at least because the avian cell is 
heterologous to the normal phiC31 phage/bacterial cell system. The size of the 
pseudo-recombination site can be determined through the use of a variety of methods 
including, but not limited to, (i) sequence alignment comparisons, (ii) secondary 

25 structural comparisons, (iii) deletion or point mutation analysis to find the functional 
limits of the pseudo-recombination site, and (iv) combinations of the foregoing. 

The terms "recombinant cell" and "genetically transformed cell" refer to a cell 
comprising a combination of nucleic acid segments not found in a single cell with each 
other in nature. A new combination of nucleic acid segments can be introduced into 

30 an organism using a wide array of nucleic acid manipulation techniques available to 
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those skilled in the art. The recombinant cell may harbor a vector that is 
extragenomic, i.e. that does not covalently insert into the cellular genome, including a 
non-nuclear (e.g. mitochondrial) gcnomc(s). A recombinant cell may further harbor a 
vector or a portion thereof that is intragenomic, i.e. covalently incorporated within the 
5 genome of the recombinanl cell. 

The term "recombination site" as used herein refers to a polynucleotide stretch 
comprising a recombination site normally recognized and used by an integrase. For 
example. A, phage is a temperate bacteriophage that infects E. coh. The phage has one 
attachment site for recombination (attP) and the E. coU bacterial genome has an 
10 attachment site for recombination (attB). Both of these sites are recombination sites 
for X integrase. Recombination sites recognized by a particular integrase can be 
derived from a homologous system and associated with heterologous sequences, for 
example, the attP site can be placed in other systems to act as a substrate for the 
integrase. 

15 The terms "recombinant nucleic acid" and "recombinant DNA" as used herein 

refer to combinations of at least two nucleic acid sequences that are not naturally 
found in a eukaryotic or prokaryotic cell. The nucleic acid sequences may include, but 
are not limited to, nucleic acid vectors, gene expression regulatory elements, origins of 
replication, suitable gene sequences that when expressed confer antibiotic resistance, 

20 protein-encoding sequences and the like. The term "recombinant polypeptide" is 
meant to include a polypeptide produced by recombinant DNA techniques. A 
recombinant polypeptide may be distinct from a naturally occurring polypeptide either 
in its location, purity or structure. Generally, a recombinant polypeptide will be 
present in a cell in an amount different from that normally observed in nature. 

25 As used herein, the term "satellite DNA-based artificial chromosome 

(SAT AC)" (e.g., ACE) is a type of artificial chromosome. These artificial 
chromosomes are substantially aU neutral non-coding sequences (heterochromatin) 
except for foreign heterologous, typically gene-encoding nucleic acid, that is present 
within (see U.S. Pat. Nos. 6,025,155, issued February 15, 2000 and 6,077,697, issued 
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June 20, 2000 and International PCT application No. WO 97/40183, published 
October 30, 1997). 

The term "source of integrase activity" as used herein refers to a polypeptide or 
multimeric protein having serine recombinase (integrase) activity in an avian cell. The 
5 term may further refer lo a polynucleotide encoding the serine recombinase, such as an 
mRNA, an expression vector, a gene or isolated gene that may be expressed as the 
recombinase- specific polypeptide or protein. 

As used herein the term "therapeutic substance" refers to a component that 
comprises a substance which can provide for a therapeutic effect, for example, a 
10 therapeutic protein. 

"Transchromosomic avian" means an avian which contains an artificial 
chromosome in some or all of its cells. A transchromosomic avian can include the 
artificial chromosome in its germ cells. 

The term "transcription regulatory sequences" as used herein refers Lo 
15 nucleotide sequences that are associated with a gene nucleic acid sequence and which 
regulate the transcriptional expression of the gene. Exemplary transcription regulatory 
sequences include enhancer elements, hormone response elements, steroid response 
elements, negative regulatory elements, and the like. 

The term "transfection" as used herein refers to the process of inserting a 
20 nucleic acid into a host cell. Many techniques are well known to those skiUed in the 
art to facilitate transfection of a nucleic acid into an eukaryotic cell. These methods 
include, for instance, treating the cells with high concentrations of salt such as a 
calcium or magnesium salt, an electric field, detergent, or liposome mediated 
transfection, to render the host cell competent for the uptake of the nucleic acid 
25 molecules, and by such methods as micro-injection into a pro-nucleus, sperm- 
mediated and restriction-mediated integration. 

The term "transformed" as used herein refers to a heritable alteration in a cell 
resulting from the uptake of a heterologous DNA. 

As used herein, the term "transgene" means a nucleic acid sequence that is 
30 partly or entirely heterologous, i.e., foreign, to the transgenic animal or cell into which 
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it is introduced, or, is homologous to an endogenous gene of the transgenic animal or 
cell into which it is introduced, but which is designed to be inserted, or is inserted, into 
the animal's genome in such a way as to alter the genome of the cell into which it is 
inserted (e.g., it is inserted at a location which differs from that of the natural gene or 
5 its insertion results in a knockout). 

As used herein, a "transgenic avian" is any avian, as defined herein, in which 
one or more of the cells of the avian contain heterologous nucleic acid intioduced by 
manipulation, such as by transgenic techniques. The nucleic acid may be introduced 
into a cell, directly or indirectly, by introduction into a precursor of the cell by way of 
10 deliberate genetic manipulation, such as by microinjection or by infection with a 
recombinant virus. Genetic manipulation also includes classical cross-breeding, or in 
vitro fertihzation. The heterologous nucleic acid may be an artificial chromosome or 
may be integrated within a chromosome of the avian, or it may be extrachromosomally 
replicating DNA. 

15 I'he teiTn "trisoinic" as used herein rclcrs to a cell or animal, such as an avian 

cell or bird that has a 2n+l chromosomal complement, where n is the haploid number 
of chromosomes, for the animal species concemed. 

The terms "vector" or "nucleic acid vector" as used herein refer to a natural or 
synthetic single or double stranded plasmid or viral nucleic acid molecule (RNA or 

20 DNA) that can be transfected or transformed into cells and replicate independently of, 
or within, the host ceU genome. The term "expression vector" as used herein refers to 
a nucleic acid vector that comprises a transcription regulatory region operably linked 
to a site wherein is, or can be, inserted, a nucleotide sequence to be transcribed and, 
optionally, to be expressed, for instance, but not limited to, a sequence coding at least 

25 one polypeptide. 

Detailed Description 

The present invention provides for the production of polyclonal antibodies, for 
example, human polyclonal antibodies, in avians and isolated avian cells. Such avians 
30 or avian cells may produce any useful type of antibody including, but not limited to, 
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one or more of IgG, IgM, IgA, IgE, and IgD including each of the subtypes of these 
antibodies. For example, subtypes of IgG include IgGl, IgG2, IgG3 and IgG4. 

In one particularly useful embodiment, the invention provides for the 
production of polyclonal antibodies which are deposited in the eggs of avians, such as 
5 chickens. It has been shown that active deposition of chicken IgG into the egg is 
mediated by specific sequences on the Fc portion of the antibody (Morrison et al 
(2002) Mol Immunol. 8:619-25). The IgG Fc antibody portion has also been shown to 
mediate the deposition into an egg of either intravenously injected human IgG^ or 
human monoclonal antibody produced in vivo from a transplanted chicken B-cell line, 

10 with high efficiency (Mohammed et al (1998) Immunotechnology, 4(2): 1 15-25). 
Chicken IgY does not bind to protein A or G and therefore human IgG can be easily 
affinity purified from other proteins including chicken immunoglobulins using protein 
A and/or protein G based purification methodologies as is known in the art. In a 
particular aspect, the antibody is deposited in the yolk of the egg of the avian. 

15 The invention provides for the insertion of large DNA segments into the 

germline of avians or avian cells. In one particular aspect, the large DNA segments 
include regions encoding components necessary for the production of human 
polyclonal antibodies. In one embodiment, the DNA segments include one or more Ig 
loci. The Ig loci may include one or more of human IgX,, IgK, IgH, and portions 

20 thereof. The Ig loci may be modified to include additional components, such as 
additional variable or constant regions, or they may be in their native form. Certain Ig 
loci and other disclosure which may be useful in accordance with the present invention 
are disclosed in, for example, IIS patent application publication No. 2()()2/()132373, 
pubhshed September 19, 2002; US patent application publication No. 2002/0088016, 

25 published July 4, 2002; US patent application pubUcation No. 2004/0231012, 
pubhshed November 18, 2004; US Patent No. 6,348,349, issued February 19, 2002; 
US Patent No. 5,545,807, issued August 13, 1996; and Popov et al (1999) J. Exp. 
Med. 189: 1611-1619. The disclosures of each of these three published patent 
applications and two issued patents andjournal article are incorporated in their entirety 

30 herein by reference. In one useful embodiment, the Ig loci shown in Fig. 27 A and 27B 
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are used to produce polyclonal antibodies in accordance with the present invention. 
These loci are disclosed in Nicholson, et al (1999) J. Immunology 163(12):6898-6906. 

The DNA segments comprising regions encoding components necessary for the 
production of human polyclonal antibodies may be employed in the invention in any 
5 useful form. For example, the DNA may be linear or circular. Typically, the DNA 
segments are present in a cloning vehicle which will facilitate the germline 
transmission of the DNA encoding the polyclonal antibodies. In one embodiment, 
artificial chromosomes which include one or more transgenes comprising components 
necessary for the production of human polyclonal antibodies are contemplated for use 

10 to produce germline transgenic avians of the invention. Typically, in this embodiment, 
a germline chimeric avian is obtained from embryos or germline cells of avians, such 
as chickens, into which one or more artificial chromosomes comprising the polyclonal 
antibody transgenes have been introduced as disclosed herein. Subsequently, a 
transgenic or fully transgenic (e.g., transchromosomic) Gl bird can be obtained from 

1 5 the germline chimera. 

In one useful embodiment, one or more Tg loci are included in an artificial 
chromosome. The artificial chromosome is introduced into an avian genome as 
disclosed herein. 

In one embodiment two artificial chromosomes are used, one having an Ig 
20 heavy chain locus and the other having an Ig light chain locus. In one embodiment, 
the two artificial chromosomes are co-introduced into an avian embryo to produce a 
germline-transgenic or transchromosomic avian which contains both chromosomes in 
its genome. 

In another embodiment, one or more DNA segments comprising regions 
25 necessary for the production of human polyclonal antibodies (e.g., Ig loci) may be used 
to produce chimeric and germline transgenic avians by incorporation into the genome 
of an avian by employing integrase mediated transgenesis as disclosed herein. 

The invention also contemplates one or more transgenes comprising 
components necessary for the production of human polyclonal antibodies such as Ig 
30 loci being introduced into an immortalized avian ceU line, the cells of which may be 
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capable of secreting the polyclonal antibodies into growth medium. In one particular 
embodiment, immortalized cell lines are derived from tumor cells of an avian oviduct 
or tumor cells from other cells of an avian, for example, cell lines disclosed in US 
Patent No 10/926,707, filed August 25, 2004. 
5 The invention also provides for the production and isolation of cell lines 

capable of producing monoclonal antibodies. By using standard methodologies well 
known in the art such as those disclosed in Michael et al (1998) Proc Natl Acad Sci 
UvSA 95:1166-1171, the disclosure of which is incorporated in its entirety herein by 
reference, cells of transgenic avians which contain human Ig heavy chain and human 

10 light chain producing loci in their genomes can be used to produce ceU lines capable of 
producing human monoclonal antibodies. For example, the transgenic or 
transchromosomic chicken is immunized with an antigen and hybridomas are 
produced by fusing cells (e.g., spleen cells) of the transgenic bird to an immortalized 
cell line to produce hybridomas. Antibody produced by individual liybridoma clones 

15 is screened to identify antibody with binding specificity for the antigen. The exon 
DNA (e.g., cDNA) encoding the antibody is cloned into mammalian Ig expression 
vectors which are co-transfected into mammalian myeloma cells to produce antigen 
specific antibody. 

Further, it is contemplated that immunoglobulin genes and other useful 
20 products can be provided by the invention. For example, genes encoding monoclonal 
antibodies can be obtained from monoclonal antibody producing cell lines produced in 
accordance with the present invention. 

The present invention contemplates the production of artificial chromosomes 
containing large transgenes. In one specific embodiment, the invention provides for 
25 the production of artificial chromosomes containing yeast artificial chromosomes 
(YACs) which contain a large DNA insert such as an Ig locus. 

In one embodiment, the present invention provides for the production of 
artificial chromosomes which contain transgenes wherein the transgene is introduced 
into the artificial chromosome during the de novo construction of the artificial 
30 chromosome. In one particularly useful embodiment of the invention, production of 
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artificial chromosomes which contain large transgenes (e.g., one or more Ig locus) is 
provided for. Large transgenes as disclosed herein can refer to transgenes greater in 
size than, for example, about 8 kb or about 10 kb or about 20 kb (e.g., about 8 kb to 
about 100 mb in size or about 10 kb to about 100 mb in size). 
5 In one embodiment, the invention provides for the introduction of transgene 

DNA into a cell in which the ai-tificial chromosome is produced at the time of 
production or assembly of the artificial chromosome. For example, components useful 
for the production of an artificial chromosome and one or more transgenes are 
introduced into the ceU at about the same time leading to the production of an artificial 

10 chromosome containing the transgene or transgenes. In one embodiment, the 
transgene will include a cloning vector such as a BAG or YAC to which the transgene 
is linked when introduced into the cell. In another embodiment, the transgene is 
removed from the cloning vehicle before introduction into the cell. For example, a 
cloning vehicle containing the transgene can be digested witii a nuclease such as a 

15 restriction enzyme to release from the cloning vehicle the DNA sequence (transgene) 
to be included in the artificial chromosome during its assembly. In one embodiment, 
after restriction nuclease digestion the DNA sequence of interest is separated from the 
cloning vehicle (e.g., by electrophoresis) prior to introduction of the DNA sequence 
into the cell. Without wishing to limit the invention to any theory or mechanism of 

20 operation, it is believed that as the artificial chromosome is assembled in the cell the 
transgene(s) is incorporated into the artificial chromosome during the assembly. 

In one embodiment, for artificial chromosome assembly, cells may be 
cotransfected with the transgene DNA and ribosomal RNA encoding DNA (rDNA). 
hi one embodiment, the rDNA is included in a cloning vehicle such as a plasmid or a 

25 cosmid. in one useful aspect of the invention, the cell in which the artificial 
chromosome is produced provides for certain components which will make up the new 
artificial chromosome such as telomeric nucleotide sequences. The cells which 
contain the new transgene containing artificial chromosome are identified and isolated. 
In one embodiment, the transgene carries a selectable marker such as a drug resistant 

30 gene providing for the selection of ceUs containing the new artificial chromosome. 
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Spontaneous generation of artificial chromosomes may be accomplished by the 
introduction of heterologous DNA and a marker gene into a cell such as a fibroblast 
cell, for example, DF-I cells (US Patent No. 5,672,485, issued September 30, 1997) or 
chicken embryo fibroblast cells. However, the methods are not limited to use of a 
5 fibroblast cell and the invention contemplates the employment of any useful cell. For 
example, cell lines such as CHO cells, Hela cells and other animal cell lines, for 
example, mammalian cell lines, are contemplated for use as disclosed herein. In one 
embodiment, the present invention contemplates the introduction of a desired 
transgcnc into a cell in combination with a marker and heterologous DNA thereby 

10 providing for the spontaneous generation of artificial chromosomes containing the 
desired transgene. The desired transgene typically includes a pharmaceutical protein 
coding sequence, such as a coding sequence for a pharmaceutical protein disclosed 
herein, and/or a promoter which functions in Ihe avian oviduct or an active portion 
thereof In one useful embodiment, the desired transgene comprises one or more 

15 human Ig locus or a portion thereof. 

Any useful method for the spontaneous assembly or production of artificial 
chromosomes is contemplated for use in accordance v^^ith the present invention. That 
is, incorporation of a nucleotide sequence of interest such as a promoter (e.g., 
ovalbumin promoter, ovomucoid promoter, lysozyme promoter or other promoters 

20 which function in the avian oviduct) and/or a coding sequence for a pharmaceutical 
protein during assembly of the chromosome (e.g., spontaneous assembly) is 
contemplated. For example, spontaneous assembly of artificial chromosomes (e.g., 
dicentric chromosomes minichromosomes. satellite artificial chromosomes or 
megachromosomes) as disclosed in, for example, US Patent No. 6,743,967, issued 

25 June 1, 2004; US Patent No. 5,288,625, issued February 22, 1994; and WO97/40183, 
the disclosures of which are incorporated in their entirety herein by reference, is 
contemplated for use in conjunction with the present invention. 

A selectable marker may be included in one or more vectors which are used in 
artificial chromosome construction (e.g., transgene containing vectors and/or other 

30 vectors containing DNA useful in production of the artificial chromosomes, for 
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example, and without limitation, rDNA). In the case where multiple vectors are 
introduced into a cell to produce an artificial chromosome, some or all of the vectors 
may have a selectable marker. In such a case, the selectable markers may be different 
selectable markers. Examples of useful selectable markers include, without limitation, 

5 genes which provide for resistance to hygromycin, zeomycin, neomycin and 
blastomycin. In one embodiment, vectors, for example, linearized vectors, when 
present in a cell that is producing a chromosome of the invention, may incorporate 
efficiently into the new chromosome, thereby precluding the need for one or more 
markers. 

10 One advantage of introducing large DNA molecules into an artificial 

chromosome during its assembly is that large DNA molecules can be gel purified and 
directly transfected as a linear molecule into the ceU line in which the chromosome is 
being assembled. (lel purification is important for isolating DNA molecules such as 
YACs from the other components of the host cells including the native cellular 

15 chromosomal DNA. Large, linear YACs are routinely purified in intact form by gel 
purification methods. Large circular YACh (cYAC) are not able to migrate through 
agarose in pulsed field gel electrophoresis (PFGE) (i.e, the cYACs remain in the wells) 
and therefore cannot be gel purified. 

The present methods are contemplated for the production of artificial 

20 chromosomes which contain any useful transgene. In one embodiment, artificial 
chromosomes which contain immunoglobulin genes (e.g., coding sequences for 
immunoglobulins and/or certain native gene expression controlling regions for 
immunoglobulins), such as human immunoglobulin loci or loci portions, are produced. 
In one particularly useful embodiment, the Ig loci include coding sequences for the 

25 immunoglobulins and certain native gene expression controlling regions of 
immunoglobulins. The human Ig containing artificial chromosome may be introduced 
into an avian such as a chicken such that the chicken produces human antibodies in its 
serum and the antibodies localize to the egg. hi one useful embodiment, the antibodies 
are polyclonal in nature and are produced by immunization of the transgenic animal 

30 with an antigen. In the case of such transgenic avians, such as chickens, the invention 
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contemplates the polyclonal human antibodies being deposited in the yolk of laying 
hens through a native transport system that has been shown to transfer antibodies, 
including human antibodies, from the blood serum to the yolk of forming eggs. In one 
embodiment, the invention contemplates the deposition of an amount between about 
5 0 . 1 |J.g and about 1 gram of polyclonal antibody per egg. 
> Human Ig genes are encoded on separate loci. Human heavy chain (IgH) is 

bchcvcd to be encoded by a single locus that is ~ 1.5 mb in size. There arc believed to 
be two loci for the human light chain, Igl£ and IgX,, either of which may be used for 
production of functional antibodies. The Igi^ locus is beUeved to be -1.1 mb and the 

10 IgA- locus is beheved to be ~3 mb. The invention contemplates the production of 
transgenic avians that carry either the light or the heavy chain or both the light and the 
heavy chain in their genome. For example, the loci may be present on one or more 
artificial chromosomes introduced into an avian's cells or may be introduced into the 
avian's genome by integrase mediated recombination as disclosed herein. 

15 In one embodiment, two artificial chromosomes are produced, one containing 

the light chain and one containing the heavy chain. In one embodiment, each artificial 
chromosome may be used to produce a separate line of animal (e.g., two lines of 
chickens). The two lines are crossed and offspring are selected that carry heavy and 
light chain artificial chromosomes. In another embodiment, the two artificial 

20 chromosomes arc co-introduced into the avian, e.g., co-injected into a germinal disc. 

In another embodiment, an artificial chromosome may be created that carries 
both the heavy locus and light chain locus allowing generation of a single line of 
animals capable of producing antibodies. 

In one embodiment of the invention, it is contemplated that the Ig gene(s) 

25 includes one or more additional variable region genes and/or one or more constant 
region genes which are not normally present in the Ig gene(s). 

Ig genes are polymorphic, particularly in the variable coding regions. 
Therefore, Ig-artificial chromosomes can be produced that are capable of creating 
polyclonal antibodies that are specifically enhanced for a particular target antigen. For 

30 example, it is found that a human family is particularly resistant to the development of 
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cancer, for example, a certain type of cancer such as breast cancer. The resistance trait 
is traced to their heavy and light chain genes, suggesting that this combination of 
heavy and light chain alleles can produce a mixture of antibodies that are exceptionally 
able to target and destroy cancer cells such as breast cancer cells. The heavy and light 
5 chain genes can be cloned from DNA extracted from a family member and inserted 
into an artificial chromosome. Therefore, in one embodiment of the invention, a 
transgenic animal such as a chicken carrying an artificial chromosome will produce 
polyclonal antibodies such that when immunized with cancer cells, or antigens thereof, 
such as breast cancer cells, or antigens thereof, polyclonal antibodies will be produced 

10 that can be used to treat cancer patients, for example, breast cancer patients. 

The present invention provides for recombinant vertebrate cells (e.g., 
transgenic or transchromosomal avian cells) and transgenic vertebrate animals (e.g., 
transgenic or transchromosomal avians) and methods of making the cells and the 
animals. For example, the invention provides for methods of inserting nucleotide 

15 sequences into the genome of verlebrate animals or into ihe cells of vcrlcbrale animals 
in a site specific manner. Examples of vertebrates include, without limitation, birds, 
mammals, fish, reptiles and amphibians. Examples of mammals include sheep, goats 
and cows. In one certain embodiment of the invention, the vertebrate animals arc birds 
or avians. Examples of birds include, without limitation, chickens, turkeys, ducks, 

20 geese, quail, pheasants, parrots, finches, hawks, crows and ratites including ostriches, 
emu and cassowary. Methods disclosed herein for producing transgenic and 
transchromosomic avians are generally applicable for aU avians. For example, though 
the size of the hard shell egg laid by avians may vary substantially (e.g., humtningbird 
eggs compared to ostrich eggs), the size and structure of the germinal disc is 

25 substantially the same among avians. Therefore, since the present invention, in large 
part, relies on the injection of large DNA molecules (e.g.. artificial chromosomes) into 
a germinal disc, a practitioner in the art would expect that the invention will function 
universally among avians. 

In one embodiment, the present invention provides for methods of inserting 

30 nucleotide sequences into the genome of an animal using methods of transgenesis 
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based on site specific integration, for example, site specific integrase mediated- 
transgenesis. The present invention contemplates any useful method of integrase 
mediated transgenesis including but not limited to, transgenesis mediated by serine 
recombinases and tyrosine recombinases. Serine recombinases are well known in the 
5 art and include without limitation, EcoYBCK^ <J>C31, SCH10.38c, SCC88.14, 
SC8F4.15C, SCD12A.23, Bxbl, WwKg Sau CcrB, Bsu CisB, TP901-1, O370.1, O105, 
^FCl, Al 18, Cacl956, Cacl951, Sau CcrA, Spn, TnpX, TndX, SPBc2, SC3C8.24, 
SC2E1.37, SCD78.04C, R45 ORvl, Y4bA5 BJa^ SsoISC1904b, SsoISC1904ag Aam, 
MjaMJ1004, Pabg SsoISC1913, HpyIS6075 MceRv0921, MtuRv0921, MtuRv2979c, 

10 MtuRv2792c, MtuISY349 5 MtuRv3828c, SauSKl, Spy, EcoTn21, MI0925 EcoTnB, 
LlagCpe, SauSK41, BmeTn5083, vSfaTn917, Bme53gRan, RmzY4Cri 5 SarpNLl, Pje, 
Xan, ISXc5, Pae, Xca, Req, Mlo90, PpsTn5501, pMEROS, Cgig MuGin, StyHin, 
Xfa91 I5 Xfa910, Rrh, SauTn552 and Aac serine recombinases. Tyrosine 
recombinases well known in the art include without limitation, BS codV, BS ripX, BS 

15 ydcF, CB tnpA, CollD ,CP4, Crc, 029, DLP12, DN int, EC FimB, EC EimE, EC orf, 
EC xerC, EC xerD, <J>11, <3>135<I>80, <I>adh, <t>CTX, <1>LC3, FLP, <I>R73, Hlorf, HI rci, 
HI xerC, HI xerD, HK22, HPl, L2, L5, L54, X, LL orf, LL xerC, LO L5, MJ orf, ML 
orf, MP int, MT int, MT orf, MV4, PI 86, P2, P21, P22, P4, P434, PA sss, PM fmiB, 
pAEl, pCLl, pKDl, pMEA, pSAM2, pSB2, pSB3, pSDL2, pSElOl, pSE211, pSMl, 

20 pSRl, pWS58, R721, Rci, SF6, vSLPl, SM orf, SsrA, SSVl, T12, Tn21, Tn4430, 
Tn554a, Tn554b, Tn7, Tn916, Tuc, WZ int, XisA and XisC. Other enzymes which 
may be useful for mediation of transgenesis in accordance with the present invention 
include, certain transposases, invertases and resolvases. 

In certain instances, integration host factors (IHF) may be necessary for the 

25 integration of nucleotide sequences of the invention into the genome of cells as 
disclosed herein. In such a case, the integration host factors may be delivered to the 
cells directly or they may be delivered to the cells in the form of a nucleic acid which, 
in the case of RNA, is translated to produce the IHF or, in the case of DNA, is 
transcribed and translated to produce the IHF. 
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The present invention contemplates the use of any system capable of site 
specifically inserting a nucleotide sequence of interest into the genome of a cell, for 
example, to produce a transgenic vertebrate animal. Typically, although not 
exclusively, these systems require at least three components: 1) a sequence in the 
5 genome which specifics the site of insertion; 2) a nucleotide sequence which is 
directed to the site of insertion and an enzyme which catalyzes the insertion of the 
nucleotide sequence into the genome at the site of insertion. Many enzymes, including 
integrases, which are capable of site specifically inserting nucleotide sequences into 
the genome have been characterized. Examples of these enzymes are disclosed in for 

10 example, Esposito et al (1997) Nucleic Acids Research, 25;36()5-3614 and Nunes- 
Duby et al (1998) Nucleic Acids Research, 26; 391-406. The disclosure of each of 
these references is incorporated herein in their entirety. 

In one embodiment of the present invention, a serine recombinase is employed. 
Serine recombinase integrase mediates recombination between an atlB site on a 

15 transgene vector and an attP or a pseudo attP site on a chromosome. In the method of 
the invention for integrase-mediated Iransgenesis, a heterologous wild-type atlP site 
can be integrated into a nuclear genome to create a transgenic ceU line or a transgenic 
vertebrate animal, such as an avian. A serine recombinase (integrase) and an attB- 
bearing transgene vector are then introduced into cells harboring the heterologous attP 

20 site, or into embryos derived from animals which bear the atlP recombination site. 
The locations of attP and attB may be reversed such that the attB site is inserted into a 
chromosome and the attP sequence resides in an incoming transgene vector. In either 
case, the art site of the introduced vector would then preferentially rccombinc with the 
integrated heterologous art site in the genome of the recipient cell. 

25 The methods of the invention are based, in part, on the discovery that there 

exists in vertebrate animal genomes, such as avian genomes, a number of specific 
nucleic acid sequences, termed pseudo-recombination sites, the sequences of which 
may be distinct from wild-type recombination sites but which can be recognized by a 
site-specific integrase and used to promote the efficient insertion of heterologous 

30 genes or polynucleotides into the targeted nuclear genome. The inventors have 
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identified pseudo-recombination sites in avian cells capable of recombining with a 
recombination site, such as an attB site within a recombinant nucleic acid molecule 
introduced into the target avian cell. The invention is also based on the prior 
integration of a heterologous att recombination site, typically isolated from a 
5 bacteriophage or a modification thereof, into the genome of the target avian cell. 

Integration into a predicted chromosomal site is useful to improve the 
predictability of expression, which is particularly advantageous when creating 
transgenic avians. Transgenesis by methods that result in insertion of the transgene 
into random positions of the avian genome is unpredictable since the transgene may 

10 not express at the expected levels or in the predicted tissues. 

The invention as disclosed herein, therefore, provides methods for site- 
specifically genetically transforming an avian nuclear genome. In general, an avian 
cell having a first recombination site in the nuclear genome is transformed with a site- 
specific polynucleotide construct comprising a second recombination sequence and 

15 one or more polynucleotides of interest. Into the same cell, intcgrase activity may be 
introduced that specifically recognizes the first and second recombination sites under 
conditions such that the polynucleotide sequence of interest is inserted into the nuclear 
genome via an integrase-mediated recombination event between the first and second 
recombination sites. 

20 The intcgrase activity, or a source thereof, can be introduced into the cell prior 

to, or concurrent with, the introduction of the site-specific construct. The intcgrase 
can be delivered to a cell as a polypeptide, or by expressing the intcgrase from a source 
polynucleotide such as an mRNA or from an expression vector that encodes the 
intcgrase, cither of which can be delivered to the tai-gct cell before, during or after 

25 delivery of the polynucleotide of interest. Any intcgrase that has activity in a cell may 
be useful in the present invention, including HK022 (Kolot et al, (2003) Biolechnol. 
Bioeng. 84: 56-60). In one embodiment, the intcgrase is a serine recombinase as 
described, for example, by Smith &, Thorpe, in Mol. Microbiol., 44: 299-307 (2002). 
For example, the intcgrase may be TP901-1 (vStoll et al, J. Bact, 184: 3657-3663 

30 (2002); Olivares et al. Gene, 278:167-176 (2001) or the intcgrase from the phage 
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phiC31. 

The nucleotide sequence of the junctions between an integrated transgene into 
the attP (or attB site) would be known. Thus, a PGR assay can be designed by one of 
skill in Ihe art lo detect when the integration event has occurred. The PGR assay for 
5 integration into a heterologous wild-type attB or attP site can also be readily 
incorporated into a quantitative PGR assay using TAQMAN"^ or related technology so 
that the efficiency of integration can be measured. 

In one embodiment, the minimal attB and attP sites able to catalyze 
recombination mediated by the phiG31 integrase are 34 and 39 bp, respectively. In 

1 0 cell lines that harbor a heterologous integrated attP site, however, integrase may have a 
preference for the inserted atti' over any pseudo-atU' sites of similar length, because 
pseudo-attP sites have very low sequence identity (for example, between 10 to 50% 
identity) compared to the more efficient wild-type attP sequence. It is within the scope 
of the methods of the invention, however, for the recombination site within the target 

15 genome to be a pseudo-att site such as a pseudo-attP site or an attP introduced into a 
genome. 

The sites used for recognition and recombination of phage and bacterial DNAs 
(the native host system) are generally non-identical, although they t5TJically have a 
common core region of nucleic acids. In one embodiment, the bacterial sequence is 

20 called the attB sequence (bacterial attachment) and the phage sequence is called the 
attP sequence (phage attachment). Because they are different sequences, 
recombination can result in a stretch of nucleic acids (for example, attL or attR for left 
and right) that is neither an attB sequence or an attP sequence, and likely is 
functionally unrecognizable as a recombination site to the relevant enzyme, thus 

25 removing the possibility that the enzyme will catalyze a second recombination reaction 
that would reverse the first. 

The integrase may recognize a recombination site where sequence of the 5' 
region of the recombination site can differ from the sequence of the 3' region of the 
recombination sequence. For example, for the phage phiG31 attP (the phage 

30 attachment site), the core region is 5'-TTG-3' the flanking sequences on either side are 
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represented here as attP5' and attP3', the structure of the attP recombination site is, 
accordingly, attP5'-TTG-attP3'. Correspondingly, for the native bacterial genomic 
target site (attB) the core region is 5'-TTG-3', and the flanking sequences on either side 
are represented here as attB5' and attB3'gthe structure of the attB recombination site is, 
5 accordingly, attB5'-TTG-attB3'. After a single-site, phiC31 integrase-mediated 
recombination event takes place between the phiC3 1 phage and the bacterial genome, 
the result is the following recombination product: attB5'-TTG-attP3'{phiC31 vector 
sequences }-attP5'-TTG-attB3'. In the method of invention, the attB site will be within 
a recombinant nucleic acid molecule that may be deUvered to a target cell. The 

10 corresponding attP (or pseudo-attP) site will be within the cell nuclear genome. 
Consequently, after phiC31 integrase mediated recombination, the recombination 
product, the nuclear genome with the integrated heterologous polynucleotide will have 
the sequence attP5'-TT(T-attB3'{heterologous polynucleotide} -attB5'-TTG-attP3'. 
Typically, after recombination the post-recombination recombination sites are no 

15 longer able to act as substrate for the phiC31 integrase. This results in stable 
integration with little or no integrase mediated excision. 

While the one useful recombination site to be included in the recombinant 
nucleic acid molecules and modified chromosomes of the present invention is the attP 
site, it is contemplated that any attP-like site may be used if compatible with the attB 

20 site. For instance, any pseudo-attP site of the chicken genome may be identified 
according to the methods of Example 7 herein and used as a heterologous att 
recombination site. For example, such attP-like sites may have a sequence that is 
greater than at least 25% identical to SEQ ID NO: 1 1 as shown in Fig. 19, such as 
described in Groth et al, Proc. Natl. Acad. Sci. U.S.A. 97: 5995-6000 (2000) 

25 incorporated herein by reference in its entirety. In one embodiment, the selected site 
will have a similar degree of efficiency of recombination, for example, at least the 
same degree of efficiency of recombination as the attP site (SEQ ID NO: 1 1) itself. 

In the present invention, the recipient cell population may be an isolated cell 
line such as, for example, DF-I chicken fibroblasts, chicken DT40 cells or a cell 

30 population derived from an early stage embryo, such as a chicken stage I embryo or 
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mid stage or late stage (e.g., stage X) embryos. One useful avian cell population is 
blastodcrmal cells isolated from a stage X avian embryo. The methods of the present 
invention, therefore, include steps for the isolation of blastodcrmal cells that are then 
suspended in a cell culture medium or buffer for maintaining the cells in a viable state, 
5 and which allows the cell suspension to contact the nucleic acids of the present 
invention. It is also within the scope of the invention for the nucleic acid construct and 
the source of integrasc activity to be delivered directly to an avian embryo such as a 
blastodcrmal layer, or to a tissue layer of an adult bird such as the lining of an oviduct. 
When the recipient cell population is isolated from an early stage avian 

10 embryo, the embryos must first be isolated. For stage I avian embryos from, for 
example, a chicken, a fertiUzed ovum is surgically removed from a bird before the 
deposition of the outer hard shell has occurred. The nucleic acids for integrating a 
heterologous nucleic acid into a recipient cell genome may then be delivered to 
isolated embryos by lipofection, microinjeclion (as described in Example 6 below) or 

15 electroporation and the like. After delivery of the nucleic acid, the transfected embryo 
and its yolk may be deposited into ihc infundibulum of a i-ecipient hen for the 
deposition of egg white proteins and a hard shell, and laying of the egg. Stage X avian 
embryos are obtained from freshly laid fertiUzed eggs and the blastodcrmal cells 
isolated as a suspension of ceUs in a medium, as described in Example 4 below. 

20 Isolated stage X blastodcrmal ceU populations, once transfected, may be injected into 
recipient stage X embryos and the hard shell eggs resealed according to the methods 
described in U.S. Patent No. 6,397,777, issued June 4, 2002, the disclosure of which is 
incorporated in its entirety by reference herein. 

In one embodiment of the invention, once a heterologous nucleic acid is 

25 delivered to the recipient cell, integrasc activity is expressed. The expressed integrase 
(or injected integrase polypeptide) then mediates recombination between the att site of 
the heterologous nucleic acid molecule, and the att (or pseudo att) site within the 
genomic DNA of the recipient avian cell. 

It is within the scope of the present invention for the intcgrasc-cncoding 

30 sequence and a promoter operably linked thereto to be included in the delivered 
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nucleic acid molecule and that expression of the integrase activity occurs before 
integration of the heterologous nucleic acid into the cell genome. In one embodiment, 
an intcgrasc-cncoding nucleic acid sequence and associated promoter arc in an 
expression vector that may be co-delivered to the recipient cell with the heterologous 
5 nucleic acid molecule to be integrated into the recipient genome. 

One suitable integrase expressing expression vector for use in the present 
invention is pCMV-C3 lint (SEQ ID NO: 1) as shown in Fig. 9, and described in Groth 
et al, Proc. Natl. Acad. Sci. U.S.A. 97: 5995-6()()() (2()()()), incorporated herein by 
reference in its entirety. In pCMV-C31int, expression of the inlegrase-encoding 
10 sequence is driven by the CMV promoter. However, any promoter may be used that 
will give expression of the integrase in a recipient cell, including operably linked 
avian-specifi'c gene expression control regions of the avian ovalbumin, lysozyme, 
ovomucin, ovomucoid gene loci, viral gene promoters, inducible promoters, the RSV 
promoter and the like. 

15 The recombinant nucleic acid molecules of the present invention for delivery 

of a heterologous polynucleotide to the genome of a recipient cell may comprise a 
nucleotide sequence encoding the attB attachment site of vStreptomyces ambofaciens as 
described in Thorpe & Smith, Proc. Natl. Acad. Sci. U.S.A. 95: 5505-5510 (1998). 
The nucleic acid molecule of the present invention may further comprise an expression 

20 cassette for the expression in a recipient ceU of a heterologous nucleic acid encoding a 
desired heterologous polypeptide. Optionally, the nucleic acid molecules may also 
comprise a marker such as, but not limited to, a puromycin resistance gene, a 
luciferase gene, EGFP, and the like. 

It is contemplated that the expression cassette, for introducing a desired 

25 heterologous polypeptide, comprises a promoter operably linked to a nucleic acid 
encoding the desired polypeptide and, optionally, a polyadenylation signal sequence. 
Exemplary nucleic acids suitable for use in the present invention are more fully 
described in the examples below. 

In one embodiment of the present invention, following delivery of the nucleic 

30 acid molecule and a source of integrase activity into a cell population, for example, an 
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avian cell population, the cells are maintained under culture conditions suitable for the 
expression of the integrase and/or for the integrase to mediate recombination between 
the recombination site of the nucleic acid and recombination site in the genome of a 
recipient cell. When the recipient cell is cultured in vitro, such cells may be incubated 
5 at 37° Celsius. For example, chicken early stage blastodermal cells may be incubated 
at 37° Celsius. They may then be injected into an embryo within a hard shell, which is 
resealed for incubation until hatching. Alternatively, the transfected cells may be 
maintained in in vitro culture. 

Li one embodiment, the present invention provides methods for the site- 

10 specific insertion of a heterologous nucleic acid molecule into the nuclear genome of a 
cell by dehverlng to a target cell that has a recombination site in its nuclear genome, a 
source of integrase activity, a site-specific construct that has another recombination 
site and a polynucleotide of interest, and allowing the integrase activity to facilitate a 
recombination event between the two recombination sites, thereby integrating the 

15 polynucleotide of interest into the nuclear genome. 

(a) Expression vector nucleic acid molecules: A variety of recombinant nucleic acid 
expression vectors arc suitable for use in the practice of the present invention. The 
site-specific constructs described herein can be constructed utilizing methodologies 
well known in the art of molecular biology (see, for example, Ausubel or Maniatis) in 

20 view of the teachings of the specification. As described above, the constructs are 
assembled by inserting into a suitable vector backbone a recombination site such as an 
attP or an attB site, a polynucleotide of interest operably linked to a gene expression 
control region of interest and, optionally a sequence encoding a positive selection 
marker. Polynucleotides of interest can include, but are not limited to, expression 

25 cassettes encoding a polypeptide to be expressed in the transformed cell or in a 
transgenic vertebrate animal derived therefrom. The site-specific constructs are 
typically, though not exclusively, circular and may also contain selectable markers, an 
origin of replication, and other elements. 

Any of the vectors of the present invention may also optionally include a 

30 sequence encoding a signal peptide that directs secretion of the polypeptide expressed 
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by the vector from the transgenic cells, for instance, from tubular gland cells of the 
oviduct of an avian. In one embodiment, this aspect of the invention effectively 
broadens the spectrum of exogenous proteins that may be deposited in the whites of 
avian eggs using the methods of the invention. Where an exogenous polypeptide 
5 would not otherwise be secreted, the vector bearing the coding sequence can be 
modified to comprise, for instance, about 60 bp encoding a signal peptide. The DNA 
sequence encoding the signal peptide may be inserted in the vector such that the signal 
peptide is located at the N-terminus of the polypeptide encoded by the vector. 

The expression vectors of the present invention can comprise a transcriptional 

10 regulatory region, for example, an avian transcriptional regulatory region, for directing 
expression of either fusion or non-fusion proteins. With fusion vectors, a number of 
amino acids are usually added to the desired expressed target gene sequence such as, 
but not limited to, a polypeptide sequence for thioredoxin. A proteolj/tic cleavage site 
may further be introduced at a site between the target recombinant protein and the 

1 5 fusion sequence. Additionally, a region of amino acids such as a polymeric histidine 
region may he introduced to allow binding of the fusion protein to metallic ions such 
as nickel bonded to a solid support, for purification of the fusion protein. Once the 
fusion protein has been purified, the cleavage site allows the target recombinant 
protein to be separated from the fusion sequence. Enzymes suitable for use in cleaving 

20 the proteolytic cleavage site include, but are not limited to. Factor Xa and thrombin. 
Fusion expression vectors that may be useful in the present invention include pGex 
(Amrad Corp., Melbourne, Australia), pRIT5 (Pharmacia, Piscataway, NJ) and pMAL 
(New England Biolabs, Beverly, MA), that fuse glutathione S-transferase, protein A, 
or maltose E binding protein, respectively, to a desired target recombinant protein. 

25 Epitope tags are short peptide sequences that are recognized by epitope specific 

antibodies. A fusion protein comprising a recombinant protein and an epitope tag can 
be simply and easily purified using an antibody bound to a chromatography resin, for 
example. The presence of the epitope tag furthermore allows the recombinant protein 
to be detected in subsequent assays, such as Western blots, without having to produce 

30 an antibody specific for the recombinant protein itself. Examples of commonly used 
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epitope tags include V5, glutathione-S -transferase ((tST), hemaglutinin (llA), the 
peptide Phe-His-His-Thr-Thr, chitin binding domain, and the like. 

Exemplary gene expression control regions for use in cells such as avian cells 
(e.g., chicken cells) include, but are not limited to, avian specific promoters such as the 
5 chicken lysozyme, ovalbumin, or ovomucoid promoters, and the like. Particularly 
useful in avian systems are tissue-specific promoters such as avian oviduct promoters 
that allow for expression and delivery of a heterologous polypeptide to an egg white. 

Viral promoters serve the same function as bacterial or eukaryotic promoters 
and either provide a specific RNA polymerase in trans (bacteriophage T7) or recruit 
10 cellular factors and RNA polymerase (SV40, RSV, CMV). Viral promoters can be 
useful as they are generally particularly strong promoters. One useful promoter for 
employment in avian cells is the RSV promoter. 

Selection markers are valuable elements in expression vectors as they provide a 
means to select for growth of only those cells that contain a vector. Common 
15 selectable marker genes include those for resistance to antibiotics such as ampicillin, 
puromycin, tetracycline, kanamycin, bleomycin, streptomycin, hygromycin, neomycin, 
ZEOCINTM, and the like. 

Another element useful in an expression vector is an origin of replication. 
Replication origins are unique DNA segments that contain multiple short repeated 
20 sequences that are recognized by multimeric origin-binding proteins and that play a 
key role in assembling DNA replication enzymes at the origin site. Suitable origins of 
replication for use in expression vectors employed herein include E. coU oriC, colEl 
plasmid origin, and the like. 

A further useful element in an expression vector is a multiple cloning site or 
25 polylinker. Synthetic DNA encoding a series of restriction endonuclease recognition 
sites is inserted into a vector, for example, downstream of the promoter element. 
These sites are engineered for convenient cloning of DNA into the vector at a specific 
position. 

Elements such as the foregoing can be combined to produce expression vectors 
30 suitable for use in the methods of the invention. Those of skill in the art will be able 
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to select and combine the elements suitable for use in their particular system in view of 
the teachings of the present specification. 

Provided for is the stable introduction of a large DNA molecule into the cell of 
an avian. In one particularly useful embodiment, the large DNA molecule is a 
5 chromosome. I'hc chromosomes to be introduced into cells of an avian may be 
referred to herein as "artificial chromosomes"; however, the term "artificial 
chromosome" is not a limiting term and any useful large DNA molecule or 
chromosome may be employed in the present invention. 

The present invention provides modified chromosomes, which are either 

10 isolated chromosomes or artificial chromosomes, which function as useful vectors to 
shuttle transgenes or gene clusters into the genome. By delivering the modified or 
artificial chromosome to an isolated recipient cell, the target cell, and progeny thereof, 
become Irisomic or transchromosomic. Typically, an additional or triosomic 
chromosome will not affect the subsequent development of the recipient cell and/or an 

15 embryo, nor interfere with the reproductive capacity of an adult developed from such 
cells or embryos. The chromosome also should be stable within chicken cells. An 
effective method is also required to isolate a population of chromosomes for delivery 
into chicken embryos or early cells. 

Chickens that are trisomic for microchromosome 16 have been described 

20 (Miller et al, Proc. Natl. Acad. Sci. U.S.A. 93: 3958-3962 (1996); MuscareUa et al, J. 
Cell Biol. 101: 1749-1756 (1985). hi these cases, triploidy and trisomy occurred 
naturally, and illustrate that an extra copy of one or more of the chicken chromosomes 
is compatible with normal development and reproductive capacity. 

The transchromosomic avians resulting from the cellular introduction of an 

25 artificial chromosome typically wiU comprise cells which include the normal 
complement of chromosomes plus at least one additional chromosome. In one 
embodiment, about 0.001% to 100% of the cells of the avian will include an additional 
chromosome. In another embodiment, about 0.1% to 100% of the cells of the avian 
wiU include an additional chromosome. In another embodiment, about 5% to 100% of 

30 the cells of the avian wiU include an additional chromosome. In another embodiment. 
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about 10% to 100% of the ceUs of the avian will include an additional chromosome. 
In another embodiment, about 50% to 100% of the cells of the avian will include an 
additional chromosome. In one particularly useful embodiment, the additional 
chromosome is transmitted through the germ-line of the transchromosomic avian and 
5 many, for example, most (i.e., more than 50%) of the cells of the offspring avians will 
include the additional chromosome. The invention contemplates the introduction and 
propagation of any useful number of chromosomes into the cell(s) of a transgenic 
avian or isolated avian cells. For example, the invention contemplates one artificial 
chromosome or two artificial chromosomes or three artificial chromosomes stably 
10 incorporated into the genome of the ccll(s) of a transchromosomal avian or isolated 
avian cells. 

Any or aU tissues of the transchromosomic avian can include the artificial 
chromosome. In one useful embodiment, one or more cells of the oviduct of the 
avians include the additional chromosome. For example, tubular gland cells of the 

15 oviduct may include the additional chromosome. 

A number of artificial chromosomes are useful in the methods of the invention, 
including, for instance, a human chromosome modified to work as an artificial 
chromosome in a heterologous species as described, for example, for mice (Tomizuka 
et al, Proc. Natl. Acad. Sci. U.S.A. 97: 722-727 (2000); for cattle (Kuroiwa et al, Nat. 

20 Biotechnol. 20: 889-894 (2002); a mammalian artificial chromosome used in mice (Co 
et al, Chromosome Res. 8: 183-191 (2000). 

Examples of large nucleic acid molecules include, but are not limited to, 
natural chromosomes and fragments thereof, for example, chromosomes (e.g., 
mammalian chromosomes) and fragments thereof which retain a centromere, artificial 

25 chromosome expression systems (saLellile DNA-based artificial chromosomes 
(SATACs); see U.S. Pat. Nos. 6,025,155, issued February 15, 2000 and 6,077,697 
issued June 20, 2000, the disclosures of which are incorporated herein in their entirety 
by reference), mammalian artificial chromosomes (MACs) (e.g., HACs), plant 
artificial chromosomes, insect artificial chromosomes, avian artificial chromosomes 

30 and minichromosomes (see, e.g., U.S. Pat. Nos. 5,712,134 issued January 27, 1998; 
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5,891,691, issued April 6, 1999; 5,288,625, issued February 22, 1994; 6,743,967 
issued June 1, 2004; and U.S. Patent Application Nos. 10/235,119, published June 19, 
2003, the disclosure of each of these six patents and the patent apphcation are 
incorporated herein in their entirety by reference). Also contemplated for use herein 
5 are YACs, BACs, bacteriophage-derived artificial chromosomes (BBPACs), cosmid or 
PI derived artificial chromosomes (PACs). 

As used herein, a large nucleic acid molecule such as artificial chromosomes 
can stably replicate and segregate alongside endogenous chromosomes in a cell. It has 
the capacity to act as a gene delivery vehicle by accommodating and expressing 

10 foreign genes contained therein. A mammalian artificial chromosome (MAC) refers to 
chromosomes that have an active mammalian centromere(s). Plant artificial 
chromosomes, insect artificial chromosomes and avian artificial chromosomes refer to 
chromosomes that include plant, insect and avian centromeres, respectively. A human 
artificial chromosome (HAC,) refers lo chromosomes that include human centromeres. 

15 For exemplary artificial chromosomes, see, e.g., U.S. Pat. Nos. 6,025,155, issued 
February 15, 2000; 6,077,697, issued June 20, 2000i 5,288,625, issued February 22, 
1994; 5,712,134, issued January 27, 1998; 5,695,967, issued December 9, 1997; 
5,869,294, issued February 9, 1999; 5,891,691, issued April 6, 1999 and 5,721,1 18, 
issued February 24, 1998 and published International PCT apphcation Nos., WO 

20 97/40183, pubhshed October 30, 1997 and WO 98/08964, published March 5, 1998, 
the disclosure of each of these eight patents and two PCT apphcations are incorporated 
in their entirety herein by reference. 

The large nucleic acid molecules (e.g., chromosomes) can include a single 
copy of a desired nucleic acid fragment encoding a particular nucleotide sequence, 

25 such as a gene of interest (e.g., transgene of interest), or can carry multiple copies 
thereof or multiple genes, different heterologous nucleotide sequences or expression 
cassettes or may encode one or more heterologous transcripts each encoding more than 
one useful protein product (for example, the transcript(s) may comprise an IRES). 
Any useful IRES may be employed in the invention. Sec, for example, US Patent No. 

30 4,937,190, issued January 26, 1990; Nature (1988) 334:320-325; J Virol (1988) 
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62:3068-3072; CcU (1992) 68:119-131; J Virol (1990) 64;4625-4631; and J Virol 
(1992) 66:1476-1483, the disclosures of which are incorporated in their entirety herein 
by reference, which disclose useful IRESs. For example, the nucleic acid molecules 
can carry 40 or even more copies of genes of interest. The large nucleic acid 
5 molecules can be associated with proteins, for example, chromosomal proteins, that 
typically function to regulate gene expression and/or participate in determining overall 
structure (e.g., nucleosomes). 

Certain useful artificial chromosomes, such as satellite DNA-based artificial 
chromosomes, can include substantially all neutral non-coding sequences 

10 (heterochromatin) except for foreign heterologous, typically gene-encoding, nucleic 
acid (see U.vS. Pat. Nos. 6,025,155, issued February 15, 2000 and 6,077,697, issued 
June 20, 2000 and International PCT application No. WO 97/40183, pubUshed 
October 30, 1997 and Lindenbaum et al Nucleic Acids Res (2004) vol 32 no. 2 1 el 72, 
the disclosures of these two patents, the PCT application and the publication are 

15 incorporated in their entirety herein by reference). Foreign genes (i.e., nucleotide 
sequences of interest) contained in these artificial chromosomes can include, but are 
not limited to, nucleic acid that encodes therapeutically effective substances (e.g., 
therapeutic proteins such as those disclosed elsewhere herein and traceable marker 
proteins (reporter genes), such as fluorescent proteins, such as green, blue or red 

20 fluorescent proteins (GFP, BFP and RFP, respectively), other reporter genes, such as 
beta-galactosidase and proteins that confer drug resistance, such as a gene encoding 
hygromycin-resi stance . 

Preferably, the artificial chromosomes employed herein do not interfere with 
the host cells' processes and can be easily purified by useful purification methods such 

25 as large-scale by high-speed flow cytometry. See, for example, de Jong, G, et al. 
Cytometry 35: 129-33, 1999, the disclosure of which is incorporated herein in its 
entirety by reference. In one embodiment, flow cytometry is employed to purify 
chromosomes according to de Jong supra, with the exception that the Hoechst 33258 
used to stain the chromosome suspension prior to flow cytometric sorting is diluted to 

30 a concentration of about 0.125 |Xg/ml opposed to 2.5 |J,g/ml. Such artificial 
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chromosomes are useful for the production of transchromosomic chickens produced by 
introduction of the chromosomes into certain cells, for example, the germline cells, of 
an avian. In one particularly useful embodiment of the present invention, the 
transchromosomic chickens are produced by microinjection of the chromosomes, for 
5 example, cytoplasmic injection of the chromosomes into avian embryos, for example, 
early stage embryos such as a Stage I embryos, see, for example, US Patent 
Apphcation No. 10/679,034, filed October 2, 2003, the disclosure of which is 
incorporated in its entirety herein by reference. 

In one embodiment, heterologous nucleic acid is introduced into an artificial 

10 chromosome. Any useful method to introduce the nucleic acid into the chromosome 
may be employed in the invention. Thereafter, the artificial chromosomes are isolated 
in a mixture substantially free of other chromosomes or cellular material. For 
example, artificial chromosomes may be isolated by flow cytometry (e.g., dual laser 
high-speed flow cytometer as described previously (de Jong, G, et al. Cytometry 35: 

15 129-33, 1999). Sec, for example, US Patent Apphcation Pubhcation No. 
20030113917, published June 19, 2003, the disclosure of which is incorporated in its 
entirety herein by reference. 

In accordance with the present invention, any useful number of artificial 
chromosomes may be introduced into an avian cell (e.g., injected), for example, an 

20 avian germinal cell such as a ccU of an ova, an embryo or a germinal disc of an avian 
egg. Any useful method of introducing the chromosomes into the avian cell is 
contemplated for use in the present invention. In addition, the invention contemplates 
the introduction of any useful number of chromosomes into an avian cell. For 
example, and without limitation, the invention contemplates the introduction of 1 to 

25 about 1,000,000 chromosomes injected per egg. In one embodiment, 1 to about 
100,000 chromosomes are injected per egg. In another embodiment about 5 to about 
100,000 artificial chromosomes are injected per egg. For example, about 10 to about 
50,000 chromosomes may be injected per egg. 

In one embodiment, there is a lower hatch rate for eggs injected with more than 

30 a certain number of chromosomes, hi one embodiment, an injection of over 100,000 
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chromosomes reduces or brings the hatch rale to zero. In another embodiment, an 
injection of over 20,000 chromosomes reduces or brings the hatch rate to zero. In 
another embodiment, an injection of over 5,000 chromosomes reduces or brings the 
hatch rate to zero. In another embodiment, an injection of over 2,()()() chromosomes 
5 reduces or brings the hatch rate to zero. For example, an injection of over 1,000 (e.g., 
550) chromosomes reduces or brings the hatch rate to zero. 

For injection, any useful volume of injection buffer may be used for each 
injection. For example, about 1 nl to about 1 jxl may be injected, hi addition, any 
useful concentration of chromosomes may be employed in the injection buffer. For 

10 example, and without limitation, 1 to about 100,000 chromosomes per microUter may 
be used. In addition, any useful number of injections may be performed on each egg. 

hi one embodiment, a concentration of 7000-11,500 chromosomes is used per 
|jl of injection buffer (Monteith, D, et al. Methods Mol Biol 240: 227-242, 2004). In 
one embodiment, 25-100 nanoliters (nl) of injection buffer is used per injection. 

15 Any useful avian embryos may be employed in the present invention. For 

example, the embryos may be collected from 24-36 week-old hens (e.g., commercial 
White Leghorn variety of G. gallus). hi one embodiment, a germinal disc is injected 
with the chromosomes. In one embodiment, the embryo donor hens are inseminated 
weekly using pooled semen from roosters to produce eggs for injection. Any useful 

20 method, such as methods known to those skilled in the art, may be employed to collect 
fertilized eggs. 

Cytoplasmic injection of artificial chromosomes can be achieved by employing 
certain microinjection systems or assemblies. In one particularly useful embodiment, 
the microinjection assembly or microinjection system disclosed in US Patent 
25 Application No. 09/919,143, filed July 31, 2001 (the '143 apphcation), the disclosure 
of which is incorporated herein in its entirety, is employed. Use of such a cytoplasmic 
injection device allows for the precise delivery of chromosomes into the cytoplasm of 
avian embryos, for example, early stage avian embryos, e.g.. Stage I embryos. 

Typically, following microinjection, the embryos are transferred to the oviduct 
30 of recipient hens utilizing any useful technique, such as that disclosed in Olsen, M and 
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Neher, B. (1948) J Exp Zool 109: 355-66 followed by incubation and hatching of the 
birds. 

Any useful method, such as PGR, may be used to test for the production of 
transchromosomic avians. T5rpically, the identification of a transchromosomic 
5 offspring is confirmed by fluorescence in-situ hybridization (FISH) and/or DNA 
analysis such as Southern blot or the like. In one useful embodiment, artificial 
chromosomes can be used as vectors to introduce large DNA pay loads, such as 
nucleotide sequences to be expressed heterologously in the avian to yield a desired 
biomolecule, of stably maintained genetic information into transgenic chickens. 

10 Production of germline transchromosomic avians is confirmed by the production of 
transchromosomic offspring from the G O birds. 

The present invention provides for the introduction of desired nucleotide 
sequences into a chromosome, the chromosome of which can subsequently be 
isolated/purified and thereafter introduced into an avian as disclosed herein. 

15 A useful chromosome isolation protocol can comprise the steps of inserting a 

lac-operator sequence (Robinclt et al J. Cell Biol. 135: 1685-1700 (1996) into an 
isolated chromosome and, optionally, inserting a desired transgene sequence within the 
same chromosome. In one embodiment, the lac operator region is a concatamer of a 
plurality of lac operators for the binding of multiple lac repressor molecules. Insertion 

20 can be accomplished, for instance, by identifying a region of known nucleotide 
sequence associated with a particular avian chromosome. A recombinant DNA 
molecule may be constructed that comprises the identified region, a recombination site 
such as attB or attP and a lac-operator concatamer. The recombinant molecule is 
delivered to an isolated avian cell, for example, but not limited to, chicken DT40 cells 

25 that have elevated homologous recombination activity compared to other avian cell 
lines, whereupon homologous recombination will integrate the heterologous 
recombination site and the lac-operator concatamer into the targeted chromosome as 
shown in the schema illustrated in Fig. 20. A tag-polypeptide comprising a label 
domain and a lac repressor domain is also delivered to the cell, for example, by 

30 expression from a suitable expression vector. The nucleotide sequence coding for a 
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GFP-lac-repressor fusion protein (Robinett et al, J. Cell Biol. 135: 1685-1700 (1996)) 
may be inserted into the same chromosome as the lac-operator insert. The lac 
repressor sequence, however, can also be within a different chromosome. An 
inducible promoter may also be used to allow the expression of the GFP-lac-repressor 
5 only after chromosome is to be isolated. 

Induced expression of the GPF-lac-repressor fusion protein will result in 
specific binding of the tag fusion polypeptide to the lac-operator sequence for 
identification and isolation of the genetically modified chromosome. The tagged 
mitotic chromosome can be isolated using, for instance, flow cytometry as described in 
10 de Jong et al Cytometry 35: 129-133 (1999) and Griffin et al Cytogenet. Cell Genet. 
87: 278-281 (1999). 

A tagged chromosome can also be isolated using microcell technology 
requiring treatment of ceils with the mitotic inhibitor colccmid to induce the formation 
of micronuclei containing intact isolated chromosomes within the cell. Final 

15 separation of the micronuclei is then accomplished by centrifugation in cytochalasin as 
described by Killary & Founder in Methods Enzymol. 254: 133-152 (1995). Further 
purification of microceUs containing only the desired tagged chromosome could be 
done by flow cytometry. It is contemplated, however, that alternative methods to 
isolate the mitotic chromosomes or microceUs, including mechanical isolation or the 

20 use of laser scissors and tweezers, and the like. 

The present invention envisions the employment of any useful protein-DNA 
binding or interaction to assist in isolating/purifying chromosomes of the invention. 
Such other methods in which a desired chromosome can be labeled for purposes of 
isolation/purification, are well known in the art including but not limited to, steroid 

25 receptor (such as the glucocorticoid receptor): site specific response element systems, 
see, for example, McNally ct al (2000) Science 287:1262-1265; the bacteriophage 
lambda repressor system; and human homeobox genes. In addition, certain mutant 
forms of proteins which are employed in these systems (e.g., mutant proteins which 
bind there substrate with greater affinity than the non-mutant form of the protein) can 

30 be particularly useful for chromosome tagging (i.e., association of the chromosome 
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with a marker that allows distinction of the chromosome, for example, distinction 
from cellular components such as other chromosomes) from other and subsequent 
isolation/purification of the chromosomes. Furthermore the invention contemplates 
the use of one or more selectable markers to identify cells which contain chromosomes 
5 comprising an introduced sequence of interest. 

One specific embodiment of the making of a recombinant artificial 
chromosome can be seen in FIG. 25. Tn this embodiment the artificial chromosome 
includes a promoter which expresses a marker. Shown in FIG. 25 is an SV40 
promoter, however, any useful promoter may be employed. For example, any 

10 promoter which will facilitate transcription in a ceU line in which the artificial 
chromosome is present may be employed. For example, and without limitation, the 
following promoters may be useful: Pol in promoters (including type 1, type 2 and 
type 3 Pol III promoters) such as HI promoters, U6 promoters, tRNA promoters, 
RNase MPR promoters and functional poitions of each of these promoters. Other 

15 promoters that may be useful include, without limitation, Pol 1 promoters, Pol II 
promoters, cytomegalovirus (CMV) promoters, rous-sarcoma virus (RSV) promoters, 
murine leukemia virus (MLV) promoters, mouse mammary tumor virus (MMTV) 
promoters, ovalbumin promoters, lysozyme promoters, conalbumin promoters, 
ovomucoid promoters, ovomucin promoters, ovotransfcrrin promoters and functional 

20 portions of each of these promoters. 

The schematic of FIG. 25 shows a vector which includes an OMC24-IRES- 
EPO nucleotide sequence of interest and a marker coding sequence both contained on 
a vector which integrates into the artificial chromosome. FIG. 25 shows ahygromycin 
resistance marker being employed. However, any useful marker (e.g., antibiotic 

25 resistant marker) may be used. For example, and without limitation, zeomycin 
resistance, neomycin resistance and blastomycin resistance markers can be used. Also 
shown is the use of an attP present on the artificial chromosome and an attB site 
present on the vector. However, any useful recombination sites and intcgrasc may be 
employed such as those disclosed elsewhere herein. 
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In one embodiment, a useful cell line such as LMTK- containing the 
chromosome (A) in FIG. 25 is transfected with the vector B by standard 
methodologies such as lipofcction. After introduction of the vector (B) into the 
artificial chromosome containing cell line, integration occurs, for example, between 
5 integration sites such as lambda attB and attP sites, wherein the hygromycin marker is 
expressed in the cells which contain the recombined artificial chromosome allowing 
for selection of the cells. For the employment of such integration sites, integrase or an 
integrase encoding gene is typically also introduced into the cell. In one useful 
embodiment, a lambda integrase gene is used which produces an integrase protein with 

10 a substitution mutation at the glutamine residue at position 174 to a lysine. This 
mutation removes the requirement for host factors allowing the integrase to function in 
cell lines. A practitioner of skill in the art will recognize that many variations to this 
basic recombination methodology may be employed. 

It is contemplated that more than one, for example, between 1 and 100 rounds 

15 of integration of a nucleotide sequence of interest into the artificial chromosome may 
be performed. For example, one, two, three, four or more rounds of integration may 
be performed. In certain useful instances of multiple insertions of nucleotide 
sequences of interest into the artificial chromosome, it can be advantageous to employ 
different selectable markers. Any useful selectable markers can be employed in the 

20 case of multiple insertions, for example, and without limitation, genes which provide 
for resistance to hygromycin, zeomycin, neomycin and blastomycin can be used. 

In one embodiment, multiple integration sites (e.g., multiple attP sites) are 
present in the artificial chromosome. Multiple rounds of integration can be performed 
to obtain insertions of more than one nucleotide sequence of interest in an artificial 

25 chromosome or to obtain an artificial chromosome with multiple copies of the same 
nucleotide sequence of interest. After each round of integration, a different marker can 
be used for each round of integration. For example, the nucleotide sequence to be 
inserted can include a hygromycin resistance coding sequence in the first round of 
integration, in the second round of integration the nucleotide to be inserted can include 

30 a zeomycin resistance marker coding sequence, in the third round of integration the 
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nucleotide sequence to be inserted can include a neomycin resistance marker coding 
sequence and , in the fourth round of integration the nucleotide sequence to be inserted 
can include a blastomycin resistance marker coding sequence. A round of integration 
is where a nucleotide sequence of interest is introduced into the cell containing the 
5 artificial chromosome in which inlegralion of the nucleotide sequence lakes place as 
disclosed herein (e.g., integration into a site having a promoter proximal to the 
integration site which is operable to express the marker). This is merely an example of 
a method for integration of multiple nucleotide sequences of interest into an artificial 
chromosome for use as disclosed herein. The employment of other useful 

10 methodologies for integration of multiple nucleotide sequences into an artificial 
chromosome, as are understood by a practitioner of skiU in the art, is included within 
the scope of the invention. 

In certain instances it can be useful to rescue an artificial chromosome from an 
avian or cultured avian cell which contains the artificial chromosome in its genome. 

1 5 For example, certain artificial chromosomes may fragment after being introduced into 
an avian. During fragmentation, the artificial chromosome can be reduced 
substantially in size, (e.g., reduced by about 50% or about 60% or about 70% or about 
80% or about 90% or more in size). However, after fragmentation the aitificial 
chromosome can stabilize and as such is no longer susceptible to significant 

20 fragmentation or degradation in avian cells. These stabilized artificial chromosomes 
can be particularly useful for efficiently producing transchromosomic avians because, 
since the chromosomes are stabilized, they do not further fragment upon introduction 
into an avian or avian cell. 

After recovery of the artificial chromosome from cells of the transchromsomic 

25 avian in which the artificial chromosome was originally introduced, the chromosome 
can be re-introduced into a cell line such as a CTK) cell line in which a nucleotide 
sequence of interest is introduced into an integration site present in the aitificial 
chromosome, as disclosed herein, producing a recombinant stabilized artificial 
chromosome. In one embodiment, multiple integration sites of the same type are 

30 present on the artificial chromosome thereby increasing the chance that one or more of 
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the integration sites remain present on the artificial chromosome after the 
fragmentation occurs. The recombinant stabilized chromosome can then be introduced 
into an avian embryo to produce a line of avians containing in their genome a 
stabilized artificial chromosome comprising a transgene. 
5 Any useful method may be used to recover the stabilized artificial 

chromosomes from the bird. For example, intact chromosomes can be prepared from 
blood cells of the birds as is known in the art, for example, essentially as disclosed in 
de Jong et al. Cytometry 35: 129-133 (1999) and Griffm et al. Cytogenet. Cell Genet. 
87: 278-28 1 ( 1999). After preparation, the chromosomes are flow sorted to isolate the 

10 stabilized chromosome. In one embodiment, the stabilized artificial chromosome is of 
a size that does not allow it to be easily distinguished from the debris field in the 
cytometry histogram. In this instance sequence specific marker dyes can be used to tag 
the slahilized artificial chromosome thereby facilitating its puril'icalion. In one usel'ul 
embodiment, polyamide probes can be used to tag the stabilized artificial chromosome 

15 as disclosed herein. 

Typically, the artificial chromosomes introduced into avians are stably 
maintained in the avians and are passed to offspring through the germline. In addition, 
artificial chromosomes can be stably maintained in avian ceU lines such as chicken cell 
line (DT-40). 

20 The invention is also useful for visualizing gene activity in avian cells as is 

understood by a practitioner of ordinary skiU in the art (See, for example, Tsukamoto, 
et al (2000) Nature Cell Biology, 2:871-878). 

Most non-viral methods of gene transfer rely on normal mechanisms used by 
eukaryotic cells for the uptake and intracellular transport of macromolecules. In 

25 certain useful embodiments, non-viral gene delivery systems of the present invention 
rely on endocytic pathways for the uptake of the subject transcriptional regulatory 
region and operably linked polypeptide-encoding nucleic acid by the targeted cell. 
Exemplary gene delivery systems of this type include liposomal derived systems, poly- 
lysine conjugates, and artificial viral envelopes. Modified chromosomes as described 
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above may be delivered to isolated avian embryonic cells for subsequent introduction 
to an embryo. 

In a representative embodiment, a nucleic acid molecule can be entrapped in 
liposomes bearing positive charges on their surface (e.g., lipofectins) and (optionally) 
5 which are tagged with antibodies against cell surface antigens of the target tissue 
(Mizuno et al, 1992, NO Shinkei Geka 20: 547-551; PCX publication W09 1/06309, 
published May 16, 1991; Japanese patent application 1047381, published February 21, 
1989; and European patent publication EP-A-43075, published January 6, 1982, all of 
which arc incorporated herein by reference in their entireties). 

10 Li similar fashion, the gene delivery system can comprise an antibody or cell 

surface ligand that is cross-Unked with a gene binding agent such as polylysine (see, 
for example, PCT publications WO93/04701, published March 18, 1993; 
W092/22635, published JJecember 23, 1992; WO92/20316, published November 26, 
1992; W092/19749, published November 12, 1992; and WO92/06180, pubHshed 

15 April 16, 1992, the disclosures of which are incorporated herein by reference in their 
entireties). It will also be appreciated that effective delivery of the subject nucleic acid 
constructs via receptor-mediated endocytosis can be improved using agents which 
enhance escape of genes from the endosomal structures. For instance, whole 
adenovirus or fusogenic peptides of the influenza HA gene product can be used as part 

20 of the delivery system to induce efficient disruption of DNA-containing endosomes 
(Mulligan et al, 1993, Science 260:926-932; Wagner et al, 1992, Proc. Natl. Acad. Sci. 
89:7934-7938; and Christiano ct al, 1993, Proc. NaLi. Acad. Sci. 90:2122-2126, all of 
which are incorporated herein by reference in their entireties). It is further 
contemplated that a recombinant nucleic acid molecule of the present invention may 

25 be delivered to a target host cell by other non-viral methods including by gene gun, 
microinjection, sperm-mediated transfer, or the like. 

In one embodiment of the invention, an expression vector that comprises a 
recombination site, such as an attB site, and a region encoding a polypeptide deposited 
into an egg white are delivered to oviduct cells by in vivo electroporation. In this 

30 method, the luminal surface of an avian oviduct is surgically exposed. A buffered 
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solution of the expression vector and a source of integrase activity such as a second 
expression vector expressing integrase (for example, pCMV-int) is deposited on the 
luminal surface. Electroporation electrodes are then positioned on either side of the 
oviduct wall, the luminal electrode contacting the expression vector solution. After 
5 electroporation, the surgical incisions are closed. The electroporation will deliver the 
expression vectors to some, if not all, treated recipient oviduct cells to create a tissue- 
specific chimeric animal. Expression of the integrase allows for the integration of the 
heterologous polynucleotide into the genome of recipient oviduct cells. While this 
method may be used with any bird, a useful recipient is a chicken due to the size of the 

10 oviduct. Also useful is a transgenic bird that has a transgenic attP recombinant site in 
the nuclear genomes of recipient oviduct cells, thus increasing the efficiency of 
integration of the expression vector. 

The attB/P integrase system is useful in the in vivo electroporation method to 
allow the formation of stable gcnelically transformed oviduct cells that otherwise 

15 progressively lose the heterologous expression vector. 

The stably modified oviduct cells will express the heterologous polynucleotide 
and deposit the resulting polypeptide into the egg white of a laid egg. For this 
purpose, the expression vector will further comprise an oviduct- specific promoter such 
as ovalbumin or ovomucoid operably linked to the desired heterologous 

20 polynucleotide. 

Another aspect of the invention is the generation of a trisomic or 
transchromosomic avian cell comprising a genetically modified extra chromosome. 
The extra chromosome may be an artificial chromosome or an isolated avian 
chromosome that has been genetically modified. Introduction of the extra 

25 chromosome to an avian cell will generate a trisomic or transchromosomic cell with 
2n+l chromosomes, where n is the haploid number of chromosomes of a normal avian 
cell. 

Dehvery of an isolated chromosome into an isolated avian cell or embryo can 
be accomplished in several ways. Isolated mitotic chromosomes or a micronucleus 
30 containing an interphase chromosome can be injected into early stage I embryos by 
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cytoplasmic injection. The injected zygote would then be surgically transferred to a 
recipient hen for the production and laying of a hard shell egg. This hard shell egg 
would then be incubated until hatching of a chick. 

In one embodiment, isolated microcells which contain the artificial 
5 chromosome can be fused to primordial germ cells (PGCs) isolated from the blood 
stream of late stage 15 embryos as described by Killary & Foumier in Methods 
Enzymol. 254: 133-152 (1995). The PGC/microcell hybrids can then be transplanted 
into the blood stream of a recipient embryo to produce germline chimeric chickens. 
(SeeNailo el al (1994) Mol. Reprod. Dev. 39: 153-161). The manipulated eggs would 

10 then be incubated until hatching of the bird. 

Blastodermal cells isolated from stage X embryos can be transfected with 
isolated mitotic chromosomes. Following in vitro transfection, the ceUs are 
transplanted back into stage X embryos as described, for example, in Etches et al, 
Poult. Sci., 72: 882-889 (1993), and the manipulated eggs are incubated to hatching. 

15 Stage X blastodermal cells can also be fused with isolated microcells and then 

transplanted back into to stage X embryos or fused lo somatic cells to be used as 
nuclear donors for nuclear transfer as described by Kuroiwa et al, Nat. Biotechnol. 20: 
889-894 (2002). 

Chromosomal vectors, as described above, may be delivered to a recipient 
20 avian cell by, for example, microinjection, liposomal delivery or microcell fusion. 

In the methods of the invention, a site-specific integrase is introduced into an 
avian cell whose genome is to be modified. Methods of introducing functional proteins 
into cells are well known in the art. Introduction of purified integrase protein can 
ensure a transient presence of the protein and its activity. Thus, the lack of 
25 permanence associated with most expression vectors is not expected to be detrimental. 

The integrase used in the practice of the present invention can be introduced 
into a target cell before, concurrently with, or after the introduction of a site-specific 
vector. The integrase can be directly introduced into a cell as a protein, for example, 
by using liposomes, coated particles, or microinjection, or into the blastodermal layer 
30 of an early stage avian embryo by microinjection. A source of the integrase can also 



76 



wo 2006/093847 



PCT/US2006/006752 



be delivered to an avian cell by introducing to the cell an mRNA encoding the 
integrase and which can be expressed in the recipient cell as an integrase polypeptide. 
Alternately, a DNA molecule encoding the integrase can be introduced into the cell 
using a suitable expression vector. 
5 The present invention provides novel nucleic acid vectors and methods of use 

that allow integrases, such as phiC3 1 integrase, to efficiently integrate a heterologous 
nucleic acid into a vertebrate animal genome, for example, an avian genome. A novel 
finding is that the phiC3 1 integrase is remarkably efficient in avian cells and increases 
the rate of integration of heterologous nucleic acid at least 30-fold over that of random 

10 integration. Furthermore, the phiC31 integrase works equally well at 37°C and 

indicating that it will function in the environment of the developing avian embryo, as 
shown in Example 1. 

It is important to note that the present invention is not bound by any 
mechanism or theory of operation. For example, the mechanism by which integrase, 

15 or any other substance described herein, facilitates transgenesis is unimportant. 
Integrase, for example, may facilitate transgenesis by mediating the integration of 
DNA into the genome of a recipient cell or integrase may facilitate transgenesis by 
facilitating the entry of the DNA into the cell or integrase may facilitate transgenesis 
by some other mechanism. 

20 The site- specific vector components described above are useful in the 

construction of expression cassettes containing sequences encoding an integrase. One 
integrase-expressing vector useful in the methods of the invention is pCMV-C31int 
(SEQ ID NO: 1 as shown in Fig. 9) where the phi(B 1 integrase is encoded by a region 
under the expression control of the strong CMV promoter. Another useful promoter is 

25 the RSV promoter as used in SEQ ID NO: 9 shown in Fig. 17. Expression of the 
integrase is typically desired to be transient. Accordingly, vectors providing transient 
expression of the integrase are useful. However, expression of the integrase can be 
regulated in other ways, for example, by placing the expression of the integrase under 
the control of a regulatable promoter (i.e., a promoter whose expression can be 

30 selectively induced or repressed). 
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Delivery of the nucleic acids introduced into cells, for example, embryonic 
cells (e.g., avian cells), using methods of the invention may also be enhanced by 
mixing the nucleic acid to be introduced with a nuclear locaUzation signal (NLS) 
peptide prior to introduction, for example, microinjection, of the nucleic acid. Nuclear 
5 localization signal (NLS) sequences are a class of short amino acid sequences which 
may be exploited for cellular import of linked cargo into a nucleus. The present 
invention envisions the use of any useful NLS peptide, including but not limited to, the 
NLS peptide of SV40 virus T-antigcn. 

An NLS of the invention is an amino acid sequence which mediates nuclear 

10 transport into the nucleus, wherein deletion of the NLS reduces transport into the 
nucleus. In certain embodiments, an NLS is a cationic peptide, for example, a highly 
cationic peptide. The present invention includes the use of any NLS sequence, 
including but not limited to, SV40 virus T-antigen. NLSs known in the art include, 
but are not limited to those discussed in Cokol et al, 2000, EMBO Reports, 1(5):411- 

15 415, Boulikas, T., 1993, Crit. Rev. liukaryot. Gene Expr., 3:193-227, Collas, P. et al, 
1996, Transgenic Research, 5: 451-458, Collas and Alestrom, 1997, Biochem. C^cll 
Biol. 75: 633-640, Collas and Alestrom, 1998, Transgenic Research, 7: 303-309, 
Collas and Alestrom, Mol. Reprod. Devel., 1996, 45:431-438. The disclosure of each 
of these references is incorporated by reference herein in its entirety. 

20 Not to be bound by any mechanism of operation, DNA is protected and hence 

stabilized by cationic polymers. The stability of DNA molecules in the cytoplasm of 
cells may be increased by mixing the DNA to be introduced, for example, 
microinjected with cationic polymers (for example, branched cationic polymers), such 
as polyethylenimine (PEL), polylysine, DEAE-dextran, starburst dendrimers, starburst 

25 polyamidoamine dendrimers, and other materials that package and condense the DNA 
molecules (Kukowska-Latallo et al, 1996, Proc. Natl. Acad. Sci. USA 93:4897-4902). 

Once the DNA molecules are delivered to the cytoplasm of cells, they migrate 
into the cell's endocytotic vesicles. Furthermore, migration into the cell's endosome is 
followed by fast inactivation of DNA within the endolysosomal compartment in 

30 transfcctcd or injected cells, both in vitro and in vivo (Godbey, W, ct al 1999, Proc 
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Natl Acad Sci U S A 96: 5177-5181; and Lechardeur, D^et al 1999, Gene Ther 6: 482- 
497; and references cited therein). Accordingly, in certain embodiments, DNA uptake 
is enhanced by the receptor-mediated endocytosis pathway using transferrin-polylysine 
conjugates or adenoviral-mediated vesicle disruption to effect the release of DNA 
5 from endosomes. However, the invention is not limited to this or any other theory or 
mechanism of operation referred to herein. 

Buffering the endosomal pH using endosomal-scaping elements also protects 
DNA from degradation (Kircheis, R, et al 2001, Adv Drug Deliv Rev 53: 341-358 ; 
Boussif, O, et al 1995, Proc Natl Acad Sci U S A 92: 7297-7301; and PoUard, H, et al 

10 1998, J Biol Chem 273: 7507-7511; and references cited therein). Thus, in certain 
embodiments, DNA complexes are delivered with polycations or cationic polymers 
that possess substantial buffering capacity below physiological pH, such as 
polyethylenimine, lipopolyamines and polyamidoamine polymers. In certain 
embodiments, DNA condensing compounds, such as the ones described above, are 

15 combined with viruses (Curiel, D, et al Proc Natl Acad Sci U S A 88: 8850-8854, 
1991; Wagner, E, et al Proc Natl Acad Sci U S A 89: 6099-6103, 1992 and Gotten, M, 
et al, 1992, Proc Nad Acad Sci U S A 89: 6094-6098), viral peptides (Wagner, E, et al 
1992, Proc Natl Acad Sci U S A 89: 7934-7938; Plank, C, et al 1994, J Biol Chem 
269: 12918-12924) and subunits of toxins (LJherek, C, et al, 1998, J Biol Chem 273: 

20 8835-48). These materials significantly enhance the release of DNA from endosomes. 
In certain embodiments, viruses, viral peptides, toxins or subunits of toxins may be 
coupled to DNA/polylysine complexes via biochemical means or specifically by a 
strcptavidin-biotin bridge (Wagner et al, 1992, Proc. Natl. Acad. Sci. USA 89:6099- 
6103; Plank et al, 1994, J. Biol Chem. 269(17): 12918-12924). In other certain 

25 embodiments, the virus that is complexed with the DNA may be adenovirus, 
retrovirus, vaccinia virus, or parvovirus. The viruses may be linked to PEI or another 
cationic polymer associated with the nucleic acid, hi certain embodiments, the virus 
may be alphavirus, orthomyxovirus, or picornavirus. In certain embodiments, the 
virus is defective or chemically inactivated. The virus may be inactivated by short- 

30 wave UV radiation or the DNA intercalator psoralen plus long- wave UV. The 
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adenovirus may be coupled to polylysine, either enzymatically through the action of 
transglutaminase or biochemically by biotinylating adenovirus and streptavidinylating 
the polylysine moiety. Transferrin may also be useful in combination with cationic 
polymers, adenoviruses and/or other materials disclosed herein to produce transgenic 
5 avians. For example, DNA complexes containing PEl^ PEl-modified transferrin, and 
PEI-bound influenza peptides may be used to enhance transgenic avian production. 

In other certain embodiments, complexes containing plasmid DNA, transferrin- 
PRT conjugates, and PRI-conjugated peptides derived from the N-terminal sequence of 
the influenza virus hemagglutinin subunit HA-2 may be used to produce transgenic 

10 chickens. In certain embodiments, the PEI-conjugated peptide may be an amino- 
terminal amino acid sequence of influenza virus hemagglutinin which may be 
elongated by an amphipathic hehx or by carboxyl-terminal dimerization. 

The present invention provides for methods of dispersing or distributing 
nucleic acid in a cell, for example, in an avian cell. The avian cell may be, for 

15 example, and without limitation, a cell of a stage 1 avian embryo, a cell of a stage II 
avian embryo, a cell of a stage III avian embryo, a cell of a stage IV avian embryo, a 
cell of a stage V avian embryo, a cell of a stage VI avian embryo, a cell of a stage VTI 
avian embryo, a cell of a stage VIII avian embryo, a cell of a stage DC avian embryo, a 
cell of a stage X avian embryo, a cell of a stage XI avian embryo or a cell of a stage 

20 Xn avian embryo. In one particularly useful embodiment, the avian cell is a ceU of a 
stage X avian embryo. 

In one aspect of the present invention, cationic polymers are useful to 
distribute, for example, homogeneously distribute, nucleic acid introduced into a cell, 
for example, an embryonic avian cell. The present invention contemplates the use of 

25 cationic polymers including, but not limited to, those disclosed herein. 

However, substances other than cationic polymers also capable of distributing 
or dispersing nucleic acids in a cell are included within the scope of the present 
invention. 

The concentration of cationic polymer used is not critical though, in one useful 
30 embodiment, enough cationic polymer is present to coat the nucleic acid to be 
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introduced into the avian cell. The cationic polymer may be present in an aqueous 
mixture with the nucleic acid to be introduced into the cell at a concentration in a 
range of an amount equal to about the weight of the nucleic acid to a concentration 
wherein the solution is saturated with cationic polymer. In one useful embodiment, 
5 the cationic polymer is present in an amount in a range of about 0.01% to about 50 %, 
for example, about 0.1% to about 20% (e.g., about 5%). The molecular weights of the 
cationic polymers can range from a molecular weight of about 1,000 to a molecular 
weight of about 1,000,000. In one embodiment, the molecular weight of the cationic 
polymers range from about 5,000 to about 100,000 for example, about 20,000 to about 
10 30,000. 

In one particularly useful aspect of the invention, procedures that are effective 
to facilitate the production of a transgenic avian may be combined to provide for an 
enhanced production of a transgenic avian wherein the enhanced production is an 
improved production of a transgenic avian relative to the production of a transgenic 

15 avian by only one of the procedures employed in the combination. For example, one 
or more of integrase activity, NLS, cationic polymer or other technique useful to 
enhance transgenic avian production disclosed herein can be used in the same 
procedure to provide for an enhanced production of transgenic avians relative to an 
identical procedure which does not employ all of the same techniques useful to 

20 enhance transgenic avian production. 

Another aspect of the present invention is a vertebrate animal ceU which has 
been genetically modified with a transgene vector according to the present invention 
and as described herein. For example, in one embodiment, the transformed cell can be 
a chicken early stage blastodermal cell or a genetically transformed cell line, including 

25 a sustainable cell line. The transfected cell according to the present invention may 
comprise a transgene stably integrated into the nuclear genome of the recipient cell, 
thereby replicating with the cell so that each progeny ceU receives a copy of the 
transfected nucleic acid. A particularly useful cell line for the delivery and integration 
of a transgene comprises a heterologous attP site that can increase the efficiency of 
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integration of a polynucleotide by phiC31 integrase and, optionally, a region for 
expressing the integrase. 

A retroviral vector can be used to deliver a recombination site such as an att 
site into the cellular genomes, such as avian genomes, since an attP or attB site is less 
5 than 300 bp. For example, the attP site can be inserted into the NLB retroviral vector, 
which is based on the avian leukosis virus genome. A lentiviral vector is a particularly 
suitable vector because lentiviral vectors can transduce non-dividing cells, so that a 
higher percentage of cells will have an integrated attP site. 

The lacZ region of NLB is replaced by the attP sequence. A producer cell line 

10 would be created by transformation of, for example, the Isolde cell line capable of 
producing a packaged recombinant NLB-attP virus pseudo-typed with the envA 
envelope protein. Supernatant from the Isolde NLB-attP line is concentrated by 
centrifugation to produce high titer preparations of the retroviral vector that can then 
be used lo deUver the attP site to the genome of a cell, for example, as described in 

15 Example 9 below. 

In one embodiment, an attP-containing line of transgenic birds are a source of 
attP transgenic embryos and embryonic cells. Fertile zygotes and oocytes bearing a 
heterologous attP site in either the maternal, paternal, or both, genomes can be used for 
transgenic insertion of a desired heterologous polynucleotide. A transgene vector 

20 bearing an attB site, for example, would be injected into the cytoplasm along with 
either an integrase expression plasmid, mRNA encoding the integrase or the purified 
integrase protein. The oocyte or zygote is then cultured to hatch by ex ovo methods or 
reintroduced into a recipient hen such that the hen lays a hard shell egg the next day 
containing the injected egg. 

25 In another example, fertile stage I to Xll embryos, for example, stage Vll to XII 

embryos, hemizygous or homozygous for the heterologous integration site, for 
example, the attP sequence, may be used as a source of blastodermal cells. The cells 
are harvested and then transfected with a transgene vector bearing a second 
recombination site, such as an attB site, plus a nucleotide sequence of interest along 

30 with a source of integrase. The transfected cells are then injected into the subgerminal 
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cavity of windowed fertile eggs. The chicks that hatch will bear the nucleotide 
sequence of interest and the second integration site integrated into the attP site in a 
percentage of their somatic and germ cells. To obtain fully transgenic birds, chicks are 
raised to sexual maturity and those that are positive for the transgene in their semen 
5 are bred to non-transgenic mates. As disclosed herein, in certain embodiments, the 
cells of the invention, e.g., embryos, may include an integrase which specifically 
recognizes recombination sites and which is introduced into cells containing a nucleic 
acid construct of the invention under conditions such that the nucleic acid scqucncc(s) 
of interest wiU be inserted into the nuclear genome. Methods for introducing such an 

10 integrase into a cell are described herein. In some embodiments, the site-specific 
integrase is introduced into the cell as a polypeptide. In alternative embodiments, the 
site-specific integrase is introduced into the transgenic cell as a polynucleotide 
encoding the integrase, such as an expression cassette optionally carried on a transient 
expression vector, and comprising a polynucleotide encoding the recombinase. 

15 In one embodiment, the invention is directed to methods of using a vector for 

site-specific integration of a heterologous nucleotide sequence into the genome of a 
cell, the vector comprising a circular backbone vector, a polynucleotide of interest 
operably linked to a promoter, and a first recombination site, wherein the genome of 
the cell comprises a second recombination site and recombination between the first 

20 and second recombination sites is facilitated by an integrase. In certain embodiments, 
the integrase facilitates recombination between a bacterial genomic recombination site 
(attB) and a phage genomic recombination site (attP). 

In another embodiment, the invention is directed to a cell having a transformed 
genome comprising an integrated heterologous polynucleotide of interest whose 

25 integration, mediated by an integrase, was into a recombination site native to the cell 
genome and the integration created a recombination-product site comprising the 
polynucleotide sequence. In yet another embodiment, integration of the 
polynucleotide was into a recombination site not native to the cell genome, but instead 
into a heterologous recombination site engineered into the cell genome. 

3D In further embodiments, the invention is directed to transgenic vertebrate 
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animals, such as transgenic birds, comprising a modified cell and progeny thereof as 
described above, as weU as methods of producing the same. 

For example, cells genetically modified to carry a heterologous attB or attP site 
by the methods of the present invention can be maintained under conditions that, for 
5 example, keep them alive but do not promote growth and/or cause the cells to 
differentiate or dedifferentiate. Cell culture conditions may be permissive for the 
action of the integrase in the cells, although regulation of the activity of the integrase 
may also be modulated by culture conditions (e.g., raising or lowering the temperature 
at which the ceUs are cultured). 

10 The present invention also provides for methods of purifying artificial 

chromosomes. In one significant embodiment of the invention, the purified artificial 
chromosomes are used to produce transchromosomic animals including, but not 
limited to, transchromosomic avians (e.g., transchromosomic chickens). Any useful 
type of artificial chromosome is conlemplaled for use in the present invention. 

15 In one aspect, the present invention is directed to purifying artificial 

chromosomes useful in producing transgenic avians (e.g., chickens) by tagging the 
chromosomes with a marker dye, for example, and without limitation, a fluorescent 
marker dye. In one particularly useful aspect of the invention, sequence specific 
polyamide probes are used in the tagging process. The tagged chromosomes may be 

20 purified by methods which provide for the discrimination of the tagged chromosomes 
over untagged chromosomes, such as flow cytometry. 

For example, the method of Gygi et al (2002) Nucleic Acids Res. 30: 2790- 
2799, the disclosure of which is incorporated by reference herein in its entirety, is 
contemplated for use in the present invention. Briefly, the protocol provides for the 

25 use of synthetic polyamide probes to fluorescently label regions on the artificial 
chromosomes (e.g., heterochromatin in the case of SATACs) which are then isolated 
by flow cytometry. The polyamides may bind to the minor groove of DNA of the 
chromosomes in a sequence specific manner without the need to disrupt the 
chromosome (e.g., denature the DNA). 
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Any useful region (e.g., nucleotide sequence or sequences) of the artificial 
chromosome to be purified can be tagged using probes as disclosed herein. For 
example, any sequence present in the artificial chromosome to be purified and not 
present, or present to a lesser degree, in one or more chromosomes naturally occurring 
5 in the host cell may be tagged using a probe. For example, telomeric regions, 
centromeric regions, non-coding regions and/or coding regions of the artificial 
chromosome may be targeted by the probes thereby tagging the artificial chromosome. 
For example, the hctcrochromatic region of SA'l'ACs can be targeted for tagging since 
SATACs (or megachromosomes) (see, for example, US Patent No. 6,077,697, issued 

10 June 20, 2000) are comprised primarily of heterochromatic DNA (e.g., repeat 
sequences of the mouse major satellite DNA sequences). For example, fiuorescent in 
situ hybridization utiUzing probes designed to recognize mouse major satellite 
sequences produce an intense fluorescent signal throughout the length of the 
heterochromatic region of SATACs. The signal has been shown not to be present in 

15 background chromosomes of non-mouse cell lines such as ChYl cells showing that 
cell lines such as ChYl are useful to host the artificial chromosome when targeting the 
artificial chromosome for tagging (e.g., tagging with labeled probes). 

The invention contemplates the purification of any artificial chromosome 
useful for the production of transchromosomic animals (e.g., transchromosomic 

20 avians) as disclosed herein. For example, artificial chromosomes and methods related 
thereto such as those disclosed in U S Patent No. 6,025,155 issued February 15, 2000; 
U S Patent No. 6,743,967 issued .Tune 1, 2004; U S Patent No. 6,077,697 issued .Tune 
20, 2000; U S Patent No. 5,288,625 issued February 22, 1994; U S Patent No. 
5,721,118 issued February 24, 1998; U S Patent No. 6,133,503 issued October 17, 

25 2000; US patent publication 2003/0113917 published June 19, 2003; US patent 
pubhcation 2003/0003435 pubUshed January 2, 2003; WO 95/32297, International 
Publication Date November 30, 1995 are contemplated for use in accordance with the 
present invention. The disclosures of each of these six US patents, two pubUshed US 
patent appUcations and one published WO patent apphcation are incorporated in their 

30 entirety herein by reference. In addition, the following publications disclose artificial 
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chromosomes or methods related thereto which are contemplated for use in the present 
invention. Each of these publications is incorporated herein in its entirety by 
reference: Bower, "Constructing a fuUy defined human minichromosome: Cloning a 
centromere" (1987) Proc. 4th Eur. Congress Biotechnol. 3:571; Carine, et al, "Chinese 
5 hamster cells with a minichromosome containing centromere region of human 
chromosome 1"(1986) Somatic Cell Molec.Genet. 12:479 — 491; Carine, et al., 
"Molecular characterization of human minichromosomes with centromere from 
chromosome 1 in hamster — human hybrids"(1989) Somatic Cell Molec. Genet. 
15(15):445 — 460; Pair, et al. "Generation of a human X — derived minichromosome 

10 using telomere — associated chromosome fragmentation" (1995) EMBO J. 14:5444 — 
5454; Hadlaczky, et al, Centromere formation in mouse cells cotransformed with 
human DNA and a dominant marker gene (1991) Proc. Natl. Acad. Sci.USA, 88:8106- 
8 1 10; Hadlaczky, et al, Satellite DNA-bascd artificial chromosomes for use in gene 
therapy (2001) Curr. Opin. Mol. Ther., Apr. 3(2): 125-32; Hadlaczky and Szalay, 

15 "Mammalian artificial chromoscjmes: Potential vectors for gene therapy" (1996) 
Abstract from International Symposium on Gene Therapy of Cancer, AIDS and 
Genetic Disorders, Trieste (Italy) (Apr, 10 — 13); Hadlaczky and Szalay, "Mammalian 
artificial chromosomes: Introduction of novel genes into mammalian artificial 
chromosomes"(1996) Abstract from International Symposium on Gene Therapy of 

20 Cancers AIDS and Genetic Disorders, Trieste (Italy) (Apr. 10 — 13,); Harrington et al., 
Formation of de novo centromeres and construction of first- generation human artificial 
microchromosomes (1997) Nature Genetics, 15: (4) 345-355; HcUcr ct al. Mini — 
chromosomes derived from the human Y chromosome by telomere directed 
chromosome breakage (1996) Proc. Natl. Acad. Sci. USA, 93:7125—7130; Huxley, 

25 "Mammalian artificial chromosomes; a new tool for gene therapy" (1994) Gene 
Therapy, 1:7 — 12; Huxley, C, Mammalian artificial chromosomes and chromosome 
transgenics (1997) Trends Genet., Sep;13(9):345-7; KeresS, et al, De novo 
chromosome formations by large-scale ampUfication of the centromeric region of 
mouse chromosomes. Chromosome Res. (1996) Apr; 4(3):226-239; Katoh, et al, 

30 Construction of a novel human artificial chromosome vector for gene delivery (2004) 
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Biochem. Biophys. Res. CommuH'^Aug. 20;321(2):280-290; Larin, et al. Advances in 
human artificial chromosome technology (2002) Trends Genet., Jun;18 (6):313-9; 
Lipps, et al, Chromosome-based vectors for gene therapy (2003) Gene, 304:23-33; 
Mills, et al, Generation of an -2.4 Mb human X centromere-based minichromosome 
5 by targeted telomere-associated chromosome fragmentation in DT40 (1999) Human 
Molecular Genetics, 8(5) 751-761; Murray, et al., "Construction of artificial 
chromosomes in yeast" (1983) Nature 305:189 — 193; Masumoto, Structural and 
functional analyses of the centromere of human chromosome 21: construction of 
human artificial chromosomes (2001) Tanpakushitsu Kakusan Koso., Dec;46(16 

10 Suppl):2375-8; Praznovszky, et al, De novo chromosome formation in rodent cells 
(1991) Proc. Natl. Acad. Sci.USA, 88:11042-11046; Raimondi, et al., "X— ray 
mediated size reduction, molecular characterization and transfer in model systems of a 
human artificial minichromosome" (1996) Abstract from International Symposium on 
Gene I'herapy of Cancer, AIDS and Genetic Disorders, Trieste (Italy) Apr. 10 — 13; 

15 Robi, et al.. Artificial chromosome vectors and expression of complex proteins in 
transgenic animals (2003) Thcriogcnology, Jan 1;59(1): 107-13; I'clenius, et al, 
StabiUty of a functional murine satelUte DNA-based artificial chromosome across 
mammahan species (1999) Chromosome Research, 7:3-7; and Wang, et al, Expression 
of a Reporter Gene After Microinjection of Mammalian Artificial Chromosomes into 

20 Pronuclei of Bovine Zygotes (200 1) Mol. Reprod. and Dev. 60:433-43 8. 

Li one useful embodiment of the invention, labeled (e.g., fluorescently labeled) 
polyamide probes are employed to tag the artificial chromosomes to facilitate 
purification (e.g., flow cytometry based purification) of the artificial chromosome. 
Polyamide probes may be prepared by any useful method known in the art such as 

25 those methods disclosed in: PC17US97/12733; PCiyUS97/03332; PC;r/US97/12722; 
PCT/LTS98/06997; PCT/LIS98/02444; PCT/LTS98/02684; PCT/LTS98/01006; 
PCT/US98/03829; PCT/US98/0714 and US Patent No. 6,673,940, issued January 6, 
2004. The disclosures of each of these nine PCT appUcations and one issued patent 
are incorporated in their entirety herein by reference. Other useful methods included 

30 in the following references are contemplated for use in accordance with the present 
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invention: US Patent No. 6,673,940, issued January 6, 2004; US Patent No. 6,555,692, 
issued April 29, 2003; US Patent No. 6,506,906, issued January 14, 2003; US Patent 
No. 6,472,537, issued October 29, 2002; US Patent No. 6,432,638, issued August 13, 
2002; US Patent No. 6,303,312, issued October 16, 2001; and US Patent No. 
5 6,143,901, issued November 7, 2000. The disclosures of each of these seven issued 
patents are incorporated in their entirety herein by reference. In addition, the 

following five publications, the disclosures of which are incorporated in their entirety 
herein by reference, disclose compositions and methods which are contemplated for 
use in the present invention: Dervan, Molecular Recognition of DNA by Small 

10 Molecules (2001) Bioorganic & Medicinal Chem. 9; 2215-2235; Gygi, et al., Use of 
Fluorescent Sequence-Specific Polyamides to Discriminate Human Chromosomes by 
Microscopy and Flow Cytometry (2002) Nucleic Acids Research; 30: (13) 2790-2799; 
Rucker, et al., vSequence Specific Fluorescence Detection of Double Strand DNA 
(2003); J. Am.Chem. Soc; 125: 1195-1202; Recognition of the DNA minor groove by 

15 pyrrole-imidazole polyamides (2003) Curr Opin Struct Biol 13: 284-99; and J Am 
Chem Soc (2003) 125: 1195-202. 

Wade, et al. (J. Am. Chem. Soc, vol. 114:8783-8794 (1992)) reported the 
design of polyamides that bind in the minor groove of dsDNA at 5'- 
(A,T)G(A,T)C(A,T)-3' sequences by a dimeric, side-by-side motif. Mrksich, et al. 

20 (Proc. Natl. Acad. Sci. USA, vol. 89:7586-7590 (1992)), reported an antiparallel, side- 
by-side polyamide motif for sequence-specific recognition in the minor groove of 
dsDNA by the designed peptide l-methylimidazole-2-carboxamide netropsin. 
Trauger, et al. (Nature, vol. 382:559-561 (1996)) reports the recognition of a targeted 
dsDNA by a polyamide at subnanomolar concentrations. The disclosure of each of 

25 these three publications is incorporated by reference herein in its entirety. 

In one embodiment, the particular order of amino acids in polyamides useful 
for making labeled (e.g., fluorescently labeled) polyamide probes, and their pairing in 
dimeric, antiparallel complexes formed by association of two polyamide polymers can 
be used to determine the sequence of nucleotides in dsDNA with which the polymers 

30 preferably associate. 
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The development of pairing rules for minor groove binding polyamides derived 
from N-methylpyrrole (Py) and N-methylimidazole (Im) amino acids can provide a 
useful code to control target nucleotide base pair sequence specificity. Specifically, in 
one embodiment an Im/Py pair in adjacent polymers can distinguish G-C from C-G 
5 and both of these from A-T or T-A base pairs. A Py/Py pair can specify A-T from G- 
C but cannot distinguish A-T from T-A. White, et al. (Biochemistry, vol. 35:12532- 
12537 (1996), the disclosure of which is incorporated in its entirety herein by 
reference) reported the effects of the A-T/T-A degeneracy of Py/Im polyamide 
recognition in the minor groove of dsDNA. White, et al. (Chem. & Biol. vol. 4:569- 
10 578 (1997), the disclosure of which is incorporated in its entirety herein by reference) 
reported the pairing rules for recognition in the minor groove of dsDNA by Py/Im 
polyamides and the 5' to 3', N to C orientation preference for polyamide binding in the 
minor groove of dsDNA. 

The inclusion of an aromatic amino acid, such as 3-hydroxy-N-methylpyrrole 
15 (Hp)(made by replacing a single hydrogen atom in Py with a hydroxy group), in a 
polyamide and paired opposite Py enables A-T to be discriminated from T-A by an 
order of magnitude. Utilizing Hp together with Py and Im in polyamides provides a 
code to distinguish all four Watson-Crick base pairs (i.e., A-T, T-A, G-C, and C-G) in 
the minor groove of dsDNA, as foUows: 

20 

Pairing Code 

for Minor Groove Recognition 

Pair G-C C-G T-A A-T 

25 Im/Py + - - - 
Py/Im 

Hp/Py . . + - 

Py/Hp - - - + 

30 Favoredi(_|_) 

Disfavored (-) 

It is understood that the method of designing and making probes use as 
disclosed herein is unimportant and present invention is not limited to any particular 
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probe or to any particular polyamide probe or method of making such probe which is 
used to purify artificial chromosomes for the production of transchromosomic animals, 
such as transchromosomic avians as disclosed herein. 

One aspect of the invention are methods for generating a genetically modified 
5 cell for example, an avian cell, and progeny thereof, using a lagged chromosome. The 
methods may include providing an isolated modified chromosome comprising a lac 
operator region and a first recombination site, delivering the modified chromosome to 
an avian ceU, thereby generating a trisomic or transchromosomic avian cell, delivering 
to the avian cell a source of a tagged polypeptide comprising a fluorescent domain and 

10 a lac repressor domain, delivering a source of integrase activity to the avian cell, 
delivering a polynucleotide comprising a second recombination site and a region 
encoding a polypeptide to the avian cell, maintaining the avian cell under conditions 
suitable for the integrase to mediate recombination between the first and second 
recombination sites, thereby integrating the polynucleotide into the modified 

15 chromosome and generating a genetically modified avian cell, expressing the tag 
polypeptide by the avian cell, allowing the tag polypeptide to bind to the modified 
chromosome so as to label the modified chromosome, and isolating the modified 
chromosome by selecting modified chromosomes having a tag polypeptide bound 
thereto. 

20 In one embodiment of the invention, the second avian cell is selected from the 

group consisting of a stage VII-XII blastodermal ceU, a stage I embryo, a stage X 
embryo; an isolated primordial germ cell, an isolated non-embryonic cell, and an 
oviduct cell. 

In various embodiments, the isolated modified chromosome is an avian 
25 chromosome or an artificial chromosome. 

In other embodiments of the invention, the step of providing an isolated 
modified chromosome comprising a lac operator region and a first recombination site 
comprises the steps of generating a trisomic or tianschromosomic avian cell by 
delivering to an isolated avian cell an isolated chromosome and a polynucleotide 
30 comprising a lac operator and a second recombination site, maintaining the trisomic or 
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transchromosomic cell under conditions whereby the heterologous polynucleotide is 
integrated into the chromosome by homologous recombination, dehvering to the avian 
cell a source of a tag polypeptide to label the chromosome, and isolating the labeled 
chromosome. 

5 In one embodiment of the invention, the lac operator region is a concatamer of 

lac operators. In other embodiments of the invention, the tag polypeptide is expressed 
from an expression vector. 

In one embodiment of the invention, the tag polypeptide is microinjected into 
the cell. In various embodiments of the invention, the method of deUvery of a 
10 chromosome to an avian ceU is selected from the group consisting of liposome 
deUvery, microinjection, microcell, electroporation and gene gun deUvery, or a 
combination thereof. 

In embodiments of the invention, the fluorescent domain of the tag polypeptide 

is GFP. 

15 In one embodiment of the invention, the method further comprises the step of 

delivering the second avian cell to an avian embryo. The embryo may be maintained 
under conditions suitable for hatching as a chick. 

hi one embodiment of the invention, the second avian cell is maintained under 
conditions suitable for the proliferation of the cell, and progeny thereof. 

20 In various embodiments of the invention, the source of integrase activity is 

dehvcrcd to a first avian cell as a polypeptide or expressed from a polynucleotide, said 
polynucleotide being selected from an mRNA and an expression vector. 

In one embodiment of the invention, the tag polypeptide activity is delivered to 
the avian cell as a polypeptide or expressed from a polynucleotide operably linked to a 

25 promoter. In another embodiment of the invention, the promoter is an inducible 
promoter. In yet another embodiment of the invention, the integrase is phi(]31 
integrase and in various embodiments of the invention, the first and second 
recombination sites are selected from an attB and an attP site, but wherein the first and 
second sites are not identical. 
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Other aspects of the present invention include methods of expressing a 
heterologous polypeptide in vertebrate cells by stably transfecting cells using site- 
specific integrase-mediation and a recombinant nucleic acid molecule, as described 
herein, and culturing the transfected cells under conditions suitable for expression of 
5 the heterologous polypeptide. In addition, the present invention includes methods of 
expressing a heterologous polypeptide in a transgenic vertebrate animal by producing a 
transgenic vertebrate animal using methods known in the field or described herein in 
combination with using site-specific integration of nucleic acid molecules as described 
herein, and exposing the animal to conditions suitable for expression of the 

10 heterologous polypeptide. 

The protein of the present invention may be produced in purified form by any 
known conventional techniques. For example, in the case of heterologous protein 
production in eggs, the egg white may be homogenized and centrifuged. The 
supernatant may then be subjected to sequential ammonium sulfate precipitation and 

15 heat treatment. The fraction containing the protein of the present invention is 
subjected to gel filtration in an appropriately sized dextran or polyacrylamidc column 
to separate the proteins. If necessary, the protein fraction may be further purified by 
HPLC or other methods weU known in the art of protein purification. 

The methods of the invention are useful for expressing nucleic acid sequences 

20 that are optimized for expression in the host cells and which encode desired 
polypeptides or derivatives and fragments thereof. Derivatives include, for instance, 
polypeptides with conservative amino acid replacements, that is, those within a family 
of amino acids that are related in their side chains (commonly known as acidic, basic, 
nonpolar, and uncharged polar amino acids). Phenylalanine, tryptophan, and tyrosine 

25 are sometimes classified jointly as aromatic amino acids and other groupings are 
known in the art (see, for example, "Biochemistry", 2nd ed, L. Stryer, ed., W.H. 
Freeman & Co., 1981). Peptides in which more than one replacement has taken place 
can readily be tested for activity in the same manner as derivatives with a single 
replacement, using conventional polypeptide activity assays (e.g. for enzymatic or 

30 ligand binding activities) . 
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Regarding codon optimization, if the recombinant nucleic acid molecules are 
transfected into a recipient chicken cell, the sequence of the nucleic acid insert to be 
expressed can be optimized for chicken codon usage. This may be determined from 
the codon usage of at least one, or more than one, protein expressed in a chicken cell 
5 according to well known principles. For example, in the chicken the codon usage 
could be determined from the nucleic acid sequences encoding the proteins such as 
lysozyme, ovalbumin, ovomucin and ovotransferrin of chicken. Optimization of the 
sequence for codon usage can elevate the level of translation in avian eggs. 

The present invention provides methods for the production of a protein by cells 

10 comprising the steps of maintaining a cell, transfecting with a first expression vector 
and, optionally, a second expression vector, under conditions suitable for proliferation 
and/or gene expression and such that an integrase will mediate site specific 
recombination at art sites. The expression vectors may each have a transcription unit 
comprising a nucleotide sequence encoding a heterologous polypeptide, wherein one 

15 polypeptide is an integrase, a transcription promoter, and a transcriptional terminator. 
The cells may then be maintained under conditions for the expression and production 
of the desired heterologous polypeptidc(s). 

The present invention further relates to methods for gene expression by cells, 
such as avian cells, from nucleic acid vectors, and Iransgenes derived therefrom, that 

20 include more than one polypeptide-encoding region wherein, for example, a first 
polypeptide-encoding region can be operatively linked to an avian promoter and a 
second polypeptide-encoding region is operatively linked to an Intemal Ribosome 
Entry Sequence (IRES). It is contemplated that the first polypeptide-encoding region, 
the IREvS and the second polypeptide-encoding region of a recombinant DNA of the 

25 present invention may be arranged linearly, with the IRES operably positioned 
immediately 5' of the second polypeptide-encoding region, i'his nucleic acid construct 
can be used for the production of certain proteins in vertebrate animals or in their cells. 
For example, when inserted into the genome of an avian cell or a bird and expressed 
therein, wiU generate individual polypeptides that may be post-translationally modified 

30 and combined in the white of a hard shell bird egg. Alternatively, the expressed 
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polypeptides may be isolated from an avian egg and combined in vitro. 

The invention, therefore, includes methods for producing multimeric proteins 
including immunoglobulins, such as antibodies, and antigen binding fragments 
thereof. Thus, in one embodiment of the present invention, the multimeric protein is 
5 an immunoglobulin, wherein the first and second heterologous polypeptides are 
immunoglobulin heavy and light chains respectively. Illustrative examples of this and 
other aspects of the present invention for the production of heterologous multimeric 
polypeptides in avian cells are fully disclosed in U.S. Patent Application No. 
09/877,374, filed June 8, 2001, and U.S. Patent Application No. 10/251,364, filed 
10 September 18, 2002, both of which are incorporated herein by reference in their 
entirety. 

Accordingly, the invention further provides immunoglobulin and other 
multimeric proteins that have been produced by transgenic vertebrates including 
avians of the invention. 

15 In various embodiments, an immunoglobulin polypeptide encoded by the 

transcriptional unit of at least one expression vector may be an immunoglobulin heavy 
chain polypeptide comprising a variable region or a variant thereof, and may further 
comprise a D region, a J region, a C region, or a combination thereof. An 
immunoglobulin polypeptide encoded by an expression vector may also be an 

20 immunoglobulin light chain polypeptide comprising a variable region or a variant 
thereof, and may further comprise a J region and a C region. The present invention 
also contemplates multiple immunoglobulin regions that are derived from the same 
animal species, or a mixture of species including, but not only, human, mouse, rat, 
rabbit and chicken. In certain embodiments, the antibodies are human or humanized. 

25 In other embodiments, the immunoglobulin polypeptide encoded by at least 

one expression vector comprises an immunoglobulin heavy chain variable region, an 
immunoglobulin light chain variable region, and a linker peptide thereby forming a 
single-chain antibody capable of selectively binding an antigen. 

Examples of therapeutic antibodies that may be produced in methods of the 

30 invention include but are not limited to HERCHFllN''^ ('I'rastuzumab) (Genentech, 
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CA) which is a humanized anti-HER2 monoclonal antibody for the treatment of 
patients with metastatic breast cancer; REOPRO™ (abciximab) (Centocor) which is 
an anti-glycoprotein Ilb/IIIa receptor on the platelets for the prevention of clot 
formation; ZENAPAX™ (dachzumab) (Roche Pharmaceuticals, Switzerland) which is 
5 an immunosuppressive, humanized anti-CD25 monoclonal antibody for the prevention 
of acute renal allograft rejection; PANOREX^m which is a murine anti-17-IA cell 
surface antigen IgG2a antibody (Glaxo Wellcome/Centocor); BEC2 which is a murine 
anti-idiotype (GD3 epitope) TgG antibody (TmClone vSystem); IMC-C225 which is a 
chimeric anti-EGFR IgG antibody (ImClone System); VITAXIN™ which is a 

10 humanized anti-aVpS integrin antibody (Applied Molecular Evolution/Medlmmune); 
Campath lH/LDP-03 which is a humanized anti CD52 IgGl antibody (Leukosite); 
vSmart Ml 95 which is a humanized anti-CD33 IgG antibody (Protein Design 
Lab/Kanebo); RITUXAN™ which is a chimeric anti-CD20 IgGl antibody (IDEC 
Pharm/Genentech, Roche/Zettyaku); LYMPIIOGIDE™ which is a humanized anti- 

15 CD22 IgG antibody (Immunomedics); 1CM3 is a humanized anti-lCAM3 antibody 
(ICOS Pharm); IDEC-1 14 is a primate anti-CD80 antibody (IDEC Ph arm/Mitsubishi); 
ZEVAEINTM is a radiolabeled murine anti-CD20 antibody (IDEC/Schering AG); 
IDEC-131 is a humanized ana-CD40L anUbody (IDEC/Eisai); IDEC- 151 is a 
primatized anti-CD4 antibody (IDEC); IDEC-1 52 is a primatized anti-CD23 antibody 

20 (IDEC/Seikagaku); SMART anti-CD3 is a humanized anti-CD3 IgG (Protein Design 
Lab); 5G1 .i is a humanized anti-complement factor 5 (CS) antibody (Alexion Pharm); 
D2E7 is a humanized anti-TNF-a antibody (CATIBASF); CDP870 is a humanized 
anti-TNF-a Fab fragment (Celltech); IDEC-151 is a primatized anti-CD4 IgGl 
antibody (IDEC Pharm/SmithKline Beecham); MDX-CD4 is a human anti-CD4 IgG 

25 antibody (Medarex/Eisai/Genmab); CDP571 is a humanized anti-TNF-a IgG4 
antibody (Celltech); LDP-02 is a humanized anti-0e1-p7 antibody 
(LeukoSite/Genentech); OrthoClone OKT4A is a humanized anti-CD4 IgG antibody 
(Ortho Biotech); ANTOVA™ is a humanized anti-CD40L IgG antibody (Biogen); 
ANTEGREN"^!^ is a humanized anti-VLA-4 IgG antibody (Elan); and CAT- 152 is a 

30 human anti-TGF- antibody (Cambridge Ab Tech). 
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The invention can be used to express, in large yields and at low cost, a wide 
range of desired proteins including those used as human and animal pharmaceuticals, 
diagnostics, and livestock feed additives. Proteins such as fusion proteins, growth 
hormones, cytokines, structural proteins and enzymes including human growth 
5 hormone, interferon, lysozyme, and P-casein are examples of proteins which are 
desirably expressed in the oviduct and deposited in eggs according to the invention. 
Other possible proteins to be produced include, but are not limited to, albumin, a- 1 
antitrypsin, antithrombin HI, collagen, factors VIII, IX, X (and the like), fibrinogen, 
hyaluronic acid, insulin, lactoferrin, protein C, erythropoietin (EPO), granulocyte 

10 colony- stimulating factor (G-CSF), granulocyte macrophage colony- stimulating factor 
(GM-CSF), tissue-type plasminogen activator (tPA), feed additive enzymes, 
somatotropin, and chymotrypsin. Immunoglobulins (shown, for example in Example 
10 below) and genetically engineered antibodies, including immunotoxins which bind 
lo surl'ace antigens on human tumor cells and destroy them, can also be expressed for 

15 use as pharmaceuticals or diagnostics. 

Other specific examples of therapeutic proteins which are contemplated for 
production as disclosed herein include, with out limitation, factor VIII, b-domain 
deleted factor VIII, factor Vila, factor IX, anticoagulants; hirudin, alteplase, tpa, 
reteplase, tpa, tpa - 3 of 5 domains deleted, insulin, insulin lispro, insuUn aspart, 

20 insulin glargine, long-acting insuUn analogs, hgh, glucagons, tsh, foUitropin-beta, fsh, 
gm-csf, pdgh, ifn alpa2a, inf-apha, inf-beta lb, differs from h protein by cI7 to s, ifn- 
beta la, ifn-gammalb, il-2, il-11, hbsag, ospa, murine mab directed against t- 
lymphocyte antigen, murine mab directed against tag-72, tumor-associated 
glycoprotein, fab fragments derived from chimeric mab, directed against platelet 

25 surface receptor gpll(b)/lll(a), murine mab fragment directed against tumor-associated 
antigen cal25, murine mab fragment directed against human carcinoembryonic 
antigen, cea, murine mab fragment directed against human cardiac myosin, murine 
mab fragment directed against tumor surface antigen psma, murine mab fragments 
(fab/fab2 mix) directed against hmw-maa, murine mab fragment (fab) directed against 

30 carcinoma-associated antigen, mab fragments (fab) directed against nca 90, a surface 
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granulocyte nonspecific cross reacting antigen, chimeric mab directed against cd20 
antigen found on surface of b lymphocytes, humanized mab directed against the alpha 
chain of the il2 receptor, chimeric mab directed against the alpha chain of the 112 
receptor, chimeric mab directed against tnf-alpha, humanized mab directed against an 
5 epitope on the surface of respu^atory synctial virus, humanized mab directed against 
her 2, i.e., human epidermal growth factor receptor 2, human mab directed against 
cytokeratin tumor-associated antigen anti-ctla4, chimeric mab directed against cd 20 
surface antigen of b lymphocytes domase-alpha dnase, beta glucocerebrosidase, tnf- 
alpha, il-2-diptheria toxin fusion protein, tnfr-lgg fragment fusion protein laronidase, 

10 dnaases, alefacept, darbepoetin alfa (colony stimulating factor), tositumomab, murine 
mab, alemtuzumab, rasburicase, agalsidase beta, teriparatide, parathyroid hormone 
derivatives, adalimumab (Iggl), anakinra, biological modifier, nesiritide, human fa- 
type natriuretic peptide (hbnp), colony stimulating factors, pegvisomant, human 
growth hormone reccpltjr antagonist, recombinant activated protein c, omalizumab, 

15 immunoglobulin e (Ige) blocker and Ibrilumomab tiuxetan. 

In various embodiments of the transgenic vertebrate animal of the present 
invention, the expression of the transgene may be restricted to specific subsets of cells, 
tissues or developmental stages utilizing, for example, trans-acting factors acting on 
the transcriptional regulatory region operably linked to the polypeptide-encoding 

20 region of interest of the present invention and which control gene expression in the 
desired pattern. Tissue- specific regulatory sequences and conditional regulatory 
sequences can be used to control expression of the transgene in certain spatial patterns. 
Moreover, temporal patterns of expression can be provided by, for example, 
conditional recombination systems or prokaryotic transcriptional regulatory sequences. 

25 Another aspect of the present invention provides a method for the production 

of a heterologous protein capable of forming an antibody suitable for selectively 
binding an antigen. This method comprises a step of producing a transgenic vertebrate 
animal incorporating at least one transgene, the transgene encoding at least one 
heterologous polypeptide selected from an immunoglobulin heavy chain variable 

30 region, an immunoglobulin heavy chain comprising a variable region and a constant 
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region, an immunoglobulin light chain variable region, an immunoglobulin light chain 
comprising a variable region and a constant region, and a single-chain antibody 
comprising two pcptidc-linkcd immunoglobulin variable regions. 

In one embodiment of this method, the isolated heterologous protein is an 
5 antibody capable of selectively binding Lo an antigen and which may be generated by 
combining at least one immunoglobulin heavy chain variable region and at least one 
immunoglobulin light chain variable region, for example, cross-linked by at least one 
disulfide bridge. The combination of the two variable regions generates a binding site 
that binds an antigen using methods for antibody reconstitution that are well known in 
10 the art. 

The present invention also encompasses immunoglobulin heavy and light 
chains, or variants or derivatives thereof, to be expressed in separate transgenic avians, 
and thereafter isolated from separate media including serum or eggs, each isolate 
comprising one or more distinct species of immunoglobulin polypeptide. The method 

15 may further comprise the step of combining a plurality of isolated heterologous 
immunoglobulin polypeptides, thereby producing an antibody capable of selectively 
binding to an antigen. In this embodiment, for instance, two or more individual 
transgenic avians may be generated wherein one transgenic produces serum or eggs 
having an immunoglobulin heavy chain variable region, or a polypeptide comprising 

20 such, expressed therein. A second transgenic animal, having a second transgene, 
produces serum or eggs having an immunoglobulin light chain variable region, or a 
polypeptide comprising such, expressed therein. The polypeptides from two or more 
transgenic animals may be isolated from their respective sera and eggs and combined 
in vitro to generate a binding site capable of binding an antigen. 

25 One aspect of the present invention, therefore, concerns transgenic vertebrate 

animals such as transgenic birds, for example, transgenic chickens, comprising a 
recombinant nucleic acid molecule and which may (though optionally) expresses a 
heterologous gene in one or more cells in the animal. Suitable methods for the 
generation of transgenic animals are known in the art and are described in, for 

30 example, WO 99/19472, pubUshed April 22, 1999; WO 00/1 1151, published March 2, 
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2000; and WO 00/56932, published September 28, 2000, the disclosures of which are 
incorporated herein by reference in their entirety. 

Embodiments of the melhods for the production of a heterologous polypeptide 
by avian tissue such as oviduct tissue and the production of eggs which contain 
5 heterologous protein involve providing a suitable vector and introducing the vector 
into embryonic blastodermal cells together with an integrase, for example, a serine 
recombinase such as phiC3 1 integrase, so that the vector can integrate into the avian 
genome. A subsequent step involves deriving a mature transgenic avian from the 
transgenic blastodermal cells produced in the previous steps. Deriving a mature 
10 transgenic avian from the blastodermal cells optionally involves transferring the 
transgenic blastodermal cells to an embryo and allowing that embryo to develop fully, 
so that the cells become incorporated into the bird as the embryo is allowed to 
develop. 

Another alternative may be to transfer a transfecled nucleus to an enucleated 
1 5 recipient cell which may then develop into a zygote and ultimately an adult bird. The 

resulting chick is then grown to maturity. 

In another embodiment, the cells of a blastodermal embryo are transfected or 

transduced with the vector and integrase directly within the embryo. It is 

contemplated, for example, that the recombinant nucleic acid molecules of the present 
20 invention may be introduced into a blastodermal embryo by direct microinjection of 

the DNA into a stage X or carher embryo that has been removed from the oviduct. 

The egg is then returned to the bird for egg white deposition, shell development and 

laying. The resulting embryo is allowed to develop and hatch, and the chick allowed 

to mature. 

25 In one embodiment, a transgenic bird of the present invention is produced by 

introducing into embryonic cells such as, for instance, isolated avian blastodermal 
cells, a nucleic acid construct comprising an attB recombination site capable of 
recombining with a pseudo-attP recombination site found wilhin the nuclear genome 
of the organism from which the cell was derived, and a nucleic acid fragment of 

30 interest, in a manner such that the nucleic acid fragment of interest is stably integrated 
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into the nuclear genome of germline cells of a mature bird and is inherited in normal 
Mendelian fashion. It is also within the scope of the invention that the targeted cells 
for receiving the transgene have been engineered to have a heterologous attP 
recombination site, or other recombination site, integrated into the nuclear genome of 
5 the cells, thereby increasing the efficiency of recognition and recombination with a 
heterologous attB site. 

In either case, the transgenic bird produced from the transgenic blastodermal 
ceUs is known as a "founder". Some founders can be chimeric or mosaic birds if, for 
example, microinjection does not dehver nucleic acid molecules to all of the 
10 blastodermal cells of an embryo. Some founders will carry the transgene in the tubular 
gland cells in the magnum of their oviducts and will express the heterologous protein 
encoded by the transgene in their oviducts. If the heterologous protein contains the 
appropriate signal sequences, it will be secreted into the lumen of the oviduct and onto 
the yolk of an egg. 

15 Some founders are germline founders. A germline founder is a founder that 

carries the transgene in genetic material of its germline tissue, and may also carry the 
transgene in oviduct magnum tubular gland cells that express the heterologous protein. 
Therefore, in accordance with the invention, the transgenic bird will have tubular 
gland cells expressing the heterologous protein and the offspring of the transgenic bird 

20 will also have oviduct magnum tubular gland cells that express the selected 
heterologous protein. (Alternatively, the offspring express a phenotype determined by 
expression of the exogenous gene in a specific tissue of the avian.) 

The stably modified oviduct cells will express the heterologous polynucleotide 
and deposit the resulting polypeptide into the egg white of a laid egg. For this 

25 purpose, the expression vector will further comprise an oviduct-specific promoter such 
as ovalbumin or ovomucoid operably linked to the desired heterologous 
polynucleotide. 

llie invention also relates to methods of screening for cells (e.g., avian cells) in 
which a nucleotide sequence has been inserted. The invention provides for the 
30 isolation of such cells by employing the expression of a marker coding sequence. 
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Cells that are contemplated for use as disclosed herein include, without limitation, 
germline cells which may include sperm cells, ova cells, and embryo cells. The 
embryos may be for example, stage I, stage II, stage HI, stage IV5 stage V, stage VI, 
stage VIT, stage Vlll, stage TX, stage X, stage XI or stage XTI embryos. In one 
5 particularly useful embodiment, the cells contemplated for use include blastodermal 
cells. 

In one embodiment, a first nucleotide sequence comprising a first 
recombination site, such as recombination sites disclosed elsewhere herein (e.g., an 
attP site), also includes a functional transcription initiation site. Any useful functional 

10 transcription initiation site may be employed. In one embodiment, a U3 promoter is 
employed. In one embodiment, a long terminal repeat (LTR) region of a retrovirus is 
employed as the transcription initiation site. For example, a L TR which includes a U3 
promoter may be employed. 

Examples of other useful transcription initiation sites may include, without 

15 limitation, Pol III promoters (including type 1, type 2 and type 3 Pol III promoters) 
such as III promoters, U6 promoters, tRNA promoters, RNase MPR promoters and 
functional portions of each of these promoters. Other promoters that may be useful in 
the present invention include, without limitation, Pol I promoters, Pol U promoters, 
cytomegalovirus (CMV) promoters, rous-sarcoma virus (RSV) promoters, avian 

20 leukemia virus (ALV) promoters, actin promoters such as beta actin promoters, murine 
leukemia virus (MLV) promoters, mouse mammary tumor virus (MMTV) promoters, 
SV40 promoters, ovalbumin promoters, lysozyme promoters, conalbumin promoters, 
ovomucoid promoters, ovomucin promoters, ovotransferrin promoters and functional 
portions of each of these promoters. 

25 In accordance with the present methods, the first nucleotide sequence 

comprising the first recombination site and transcription initiation site is inserted into 
a genome of a cell by any useful method. For example, the first nucleotide sequence 
may be inserted into the genome as part of a retrovirus construct (e.g., ALV). For 
example, a retrovirus comprising an attP site may be transduced into the genome of the 

30 cell (Fig. 26). 
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The invention provides for the introduction of a second nucleotide sequence, 
which includes a second recombination site such as recombination sites disclosed 
elsewhere herein (e.g., an attB site) a nucleotide sequence of interest (denote as 
"transgene" in Fig. 26) and a promoterless marker coding sequence, into one or more 
5 cells which include the first nucleotide sequence in their genome. 

Any useful method for the introduction of the nucleotide sequences into the 
cells is contemplated for use herein. Exemplary deUvery systems for the nucleic acids 
include, without limitation, liposomal derived systems, poly-lysine conjugates, 
protoplast fusion, microinjection and electroporation. 

10 Any useful marker coding sequence may be employed in the present screening 

methods. For example, a bioluminescent protein coding sequence may serve as the 
marker coding sequence for use as disclosed herein. In one embodiment, the present 
invention contemplates the use of a green fluorescent protein (GFP) marker gene 
coding sequence. In one embodiment, antibiotic resistance is the marker. 

15 In one embodiment, the marker coding sequence is positioned such that when 

integration occurs between the first and second recombination sites, the marker 
expression will be under the control of the transcription initiation site of the first 
nucleotide sequence and will be expressed. Cells in which integration has occurred 
can be identified by expression of the marker coding sequence. 

20 The present invention provides for the isolation of one or more cells in which 

the marker coding sequence is expressed. In the case of bioluminescent markers such 
as GFP, the cells may be sorted and thereafter isolated using flow cytometry by 
methods well known in the art such as those methods disclosed in de Jong et al. 
Cytometry 35: 129-133 (1999) and Griffin et al. Cytogenet. Cell Genet. 87: 278-281 

25 (1999). Any useful methods of cell separation or isolation ai-e contemplated for use 
herein including mechanical isolation or the use of laser scissors and tweezers, and the 
like. 

In one useful embodiment, the second nucleotide sequence is introduced into 
blastodermal cells which include the first nucleotide sequence in their genome. For 
30 example, the blastodermal cells may comprise avian blastodermal cells isolated from 
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fertile embryos, such as stage VII to stage XII embryos. BlastodeTinal cells in which 
the marker coding sequence is expressed are isolated and introduced into the 
subgcrminai cavity of fertile eggs. Suitable methods for the manipulation of avian 
eggs, including opening and resealing hard shell eggs are described in U.S. Patent 
5 Serial Nos. 5,897,998 and 6,397,777 the disclosures of which are incorporated herein 
by reference in their entireties. The eggs are hatched and the chicks raised to maturity 
by methods well known in the field. 

This description uses gene nomenclature accepted by the ('ucurbit Genetics 
Cooperative as it appears in the Cucurbit Genetics Cooperative Report 18:85 (1995), 
1 0 which are incorporated herein by reference in its entirety. 

The disclosures of publications such as journal articles, patents, and pubUshed 
patent applications referred to in this application are hereby incorporated by reference 
in their entirety into the present appUcation. 

It will be apparent to those skilled in the art that various modifications, 
15 combinations, additions, deletions and variations can be made in the present invention 
without departing from the scope or spirit of the invention. For instance, features 
illustrated or described as part of one embodiment can be used in another embodiment 
to yield a still further embodiment. It is intended that the present invention covers 
such modifications, combinations, additions, deletions and variations as come within 
20 the scope of the appended claims and their equivalents. 

The present invention is further illustrated by the following examples, which 
are provided by way of illustration and should not be construed as limiting. The 
contents of all references, published patents and patents cited throughout the present 
application are hereby incorporated by reference in their entireties. 

25 

Example 1 : Phage phiC31 Integrase Functions in Avian Cells 

(a) A luciferase vector bearing either an attB (SEQ ID NO: 2 shown in Fig. 10) or attP 
(SEQ ID NO: 3 shown in Fig. 1 1) site was cotransfected with an integrase expression 
vector CMV-C31int (SEQ ID NO: 1) into DF-I cells, a chicken fibroblast cell Une. 
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The cells were passaged several times and the luciferase levels were assayed at each 
passage. 

Cells were passaged every 3-4 days and one third of the cells were harvested 
and assayed for luciferase. The expression of luciferase was plotted as a percentage of 
5 the expression measured 4 days after transfection. A luciferase expression vector 
bearing an attP site as a control was also included. 

As can be seen in Fig. 2, in the absence of integrase, luciferase expression from 
a vector bearing attP or atlB decreased to very low levels after several days. However, 
luciferase levels were persistent when the luciferase vector bearing attB was 
10 cotransfected with the integrase expression vector, indicating that the luciferase vector 
had stably integrated into the avian genome. 

(b) A drug-resistance colony formation assay was used to quantitate integration 
efficiency. The puromycin resistance expression vector pCMV-pur was outfitted with 
an attB (SEQ ID NO: 4 shown in Fig. 12) or an attP (SEQ ID NO: 5 shown in Fig. 13) 

15 sites. Puromycin resistance vectors bearing attB sites were cotransfected with phiC31 
integrase or a control vector into DF-I cells. One day after transfection, puromycin 
was added. Puromycin resistant colonies were counted 12 days post-transfection. 

In the absence of cotransfected integrase expression, few DF-I cell colonies 
were observed after survival selection. When integrase was co-expressed, multiple 

20 DF-I cell colonies were observed, as shown in Fig. 3. vSimilar to the luciferase 
expression experiment, the attB sequence (but not the attP sequence) was able to 
facilitate integration of the plasmid into the genome. Fig. 3 also shows that phiC3 1 
integrase functions at both 37° Celsius and 41° Celsius. Integrase also functions in 
quail cells using the puromycin resistance assay, as shown in Fig. 4. 

25 (c) The CMV-pur-attB vector (SEQ ID NO: 4) was also cotransfected with an 
enhanced green fluorescent protein (EGFP) expression vector bearing an atlB site 
(SEQ ID NO: 6 shown in Fig. 14) into DF-I cells and the phiC31 integrase expression 
vector CMV-C31int (SEQ ID NO: 1). After puromycin selection for 12 days, the 
colonies were viewed with UV light to determine the percentage of cells that 

30 expressed EGFP. Approximately 20% of puromycin resistant colonies expressed 
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EGFP in all of the cells of the colony, as shown in Fig. 5, indicating that the integrase 
can mediate multiple integrations per cell. 

(d) PhiC31 integrase promoted the integration of large transgenes into avian cells. A 
puromycin expression cassette comprising a CMV promoter, puromycin resistance 
5 gene, polyadenylation sequence and the attB sequence was inserted into a vector 
containing a 12.0 kb lysozynie promoter and the human interferon a2b gene (SEQ ID 
NO: 7 shown in Fig. 15) and into a vector containing a 10.0 kb ovomucoid promoter 
and the human interferon oGb gene (SFQ ID NO: 8) as shown in Fig. 16. 

DF-I cells were transfected with donor plasmids of varying lengths bearing a 
10 puromycin resistance gene and an attB sequence in the absence or presence of an 
integrase expression plasmid. Puromycin was added to the culture media to kill those 
cells which did not contain a stably integrated copy of the puromycin resistance gene. 
Cells with an integrated gene formed colonies in the presence of puromycin in 7-12 
days. The colonies were visualized by staining with methylene blue and the entire 60 
1 5 mm culture dish was imaged. 

PhiCS 1 integrase mediated the efficient integration of both vectors as shown in 

Fig. 7. 

Example 2 ; Cell Culture Methods 

20 DF-I cells were cultured in DMEM with high glucose, 10% fetal bovine 

serum, 2 mM L-glutamine, 100 units/ml penicillin and 100 Jig/ml streptomycin at 37° 
C'elsius and 5% CXh- A separate population of DF-1 cells was grown at 41° (Celsius. 
These cells were adapted to the higher temperature for one week before they were used 
for experiments. 

25 Quail QT6 cells were cultured in RO medium (Gibco) with 5% newborn calf 

scrum, 1% chicken scrum heat inactivated (at 55" Celsius for 45 mins), 10 units/ ml 
penicillin and 10 (Xg/ml streptomycin at 37" Celsius and 5% COj- 
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Example 3 : Selection and Assay Methods 

(a) Puromycin selection assay: About 0.8 x 10^ DF-1 (chicken) or QT6 (quail) cells 
were plated in 60 mm dishes. The next day, the cells were transfected as follows: 

10 to 50 ng of a donor plasmid and 1 to 10 \lg of an Integrase-expressing 
5 plasmid DNA were mixed with 150 Jll of OptiMEM. 15 fJ.1 of DMRIE-C was mixed 
with 150 |al of OptiMEM in a separate tube, and the mixtures combined and incubated 
for 15 mins. at room temperature. 

While the liposome/DNA complexes were forming, the cells were washed with 
OptiMEM and 2.5 ml of OptiMEM was added. After 15 minutes, 300 ^tl of the DNA- 
1 0 lipid mixture was added drop wise to the 2.5 ml of OptiMEM covering the cell layers. 
The cells were incubated for 4-5 hours at either 37° Celsius or 41° Celsius, 5% CO2. 
The transfection mix was replaced with 3 mis of culture media. The next day, 
puromycin was added to the media at a final concentration of 1 JXg/ml, and the media 
replaced every 2 to 4 days. Puromycin resistant colonies were counted or imaged 10- 
15 12 days after the addition of puromycin. 

(b) Luciferase assay: Chicken DF-I or quail Qr6 cells (0.8 x lO*^") were plated in 60 
mm dishes. Cells were transfected as described above. The cells from a plate were 
transferred to a new 100 mm plate when the plate became confluent, typically on day 
3-4, and re-passaged every 3-4 days. 

20 At each time point, one-third of the cells from a plate were replated, and one- 

third were harvested for the luciferase assay. The cells were pelleted in an eppendorf 
tube and frozen at -70"C . 

The cell pellet was lysed in 200 [i\ of lysis buffer (25 mM Tris-acetate, pII7.8, 
2mM EDTA, 0.5% Triton X-100, 5% glycerol). Sample (5^1) was assayed using the 

25 Promega BrightGlo reagent system. 

(c) Visualization of EGFP: EGFP expression was visualized with an inverted 
microscope with FITC illumination [Olympus 1X70, 100 W mercury lamp, IIQ-FITC 
Band Pass Emission filter cube, exciter 480/40 nm, emission 535/50 nm, 20X phase 
contrast objective (total magnification was 2.5 x 10 x 20)]. 
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(d) Staining of cell colonies: After colonies had formed, typically after 7-12 days of 
culture in puromycin medium, the cells were fixed in 2% formaldehyde, 0.2% 
glutaraldehyde for 15 mins, and stained in 0.2% methylene blue for 30 mins. followed 
by several washes with water. The plates were imaged using a standard CCD camera 

5 in visible light. 

Example 4 ; Production of Genetically Transformed Avian Cells 

Avian stage X blastodcrmal cells arc used as the cellular vector for the 
transgenes. Stage X embryos are collected and the cells dispersed and mixed with 

10 plasmid DNA. The transgenes are then introduced to blastodcrmal cells via 
electroporation. The cells are immediately injected back into recipient embryos. 

The cells are not cultured for any time period to ensure that they remain 
capable of contributing to the germline of resulting chimeric embryos. However, 
because there is no culture step, cells that bear the transgene cannot be identified. 

15 Typically, only a small percentage of cells introduced to an embryo will bear a stably 
integrated transgene (0.01 to 1%). To increase the percentage of cells bearing a 
transgene, therefore, the transgene vector bears an attB site and is co-clcctroporated 
with a vector bearing the CMV promoter driving expression of the phiC3 1 transgene 
(CMV-C31int (SEQ ID NO: 1). The inlegrase then drives integration of the transgene 

20 vector into the nuclear genome of the avian cell and increases the percentage of cells 
bearing a stable transgene. 

(a) Preparation of avian stage X blastodermal cells: 

i) Collect fertilized eggs from Barred Rock or White leghorn chickens 
(Gallus gallus) or quail (Japonica cotumix) within 48 hrs. of laying; 
25 ii) Use 70% ethanol to clean the shells; 

iii) Crack the shells and open the eggs; 

iv) Remove egg whites by transferring yolks to opposite halves of shells, 
repeating to remove most of the egg whites; 

v) Put egg yolks with embryo discs facing up into a lOcmpetri dish; 

30 vi) Use an absorbent tissue to gently remove egg white from the embryo discs; 
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vii) Place a Whatman filter paper 1 ring over the embryos; 

viii) Use scissors to cut the membranes along the outside edge of the paper ring 
while gently lifting the ring/cmbryos with a pair of tweezers; 

ix) Insert the paper ring with the embryos at a 45 degree £ingle into a petri dish 
5 containing PBS-G solution at room temperature; 

x) After ten embryo discs are collected, gently wash the yolks from the 
blastoderm discs using a Pasteur pipette under a stereo microscope; 

xi) Cut the discs by a hair ring cutter (a short piece of human hair is bent into a 
small loop and fastened to the narrow end of a Pasteur pipette with 

10 Parafilm); 

xii) Transfer the discs to a 15 ml sterile centrifuge tube on ice; 

xiii) Place 10 to 15 embryos per tube and allow to settle to the bottom (about 5 
mins.); 

xiv) Aspirate the supernatant from the tube; 

15 xv) Add 5 mis of ice-cold PBS without Ca++ and Mg^+, and gently pipette 4 to 

5 times using a 5 mis pipette; 

xvi) Incubate in ice for 5-7 mins. to allow the blastoderms to settle, and aspirate 
the supernatant; 

xvii) Add 3 mis of ice cold 0.05% trypsin/0.02% ETDA to each tube and gently 
20 pipette 3 to 5 times using a 5 ml pipette; 

xviii) Put the tube in ice for 5 mins. and then flick the tube by finger 40 times. 
Repeat; 

xix) Add 0.5 mis FBS and 3-5 mis BDC medium to each tube and gently pipette 
5-7 times using a 5 ml pipette; 

25 xx) Spin at 500 rpm ( RCF 57 x g ) at 4° Celsius for 5 mins; 

xxi) Remove the supernatant and add 2 mis ice cold BDC medium into each 
tube; and 

xxii) Resuspend the cells by gently pipetting 20-25 times; and 

xxiii) Determine the ceU titer by hemacytometer and ensure that about 95% of all 
30 BDCs are single cells, and not clumped. 
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(b) Transfection of linearized plasmids into blastodermal cells by small scale 
electroporation: 

i) Centrifuge the blastodermal cell suspension from step (xxiii) above at RCF 
57 X g, 4° Celsius, for 5 mins; 
5 ii) Resuspend cells to a density of 1-3 x lO^per ml with PBS without Ca?+ and 

Mg2+; 

iii) Add linearized DNA, 1-30 |J.g per 1-3 x 10^ blastodermal cells in an 
eppendorf tube at room temperature. Add equimolar molar amounts of the 
non-linearized transgene plasmid bearing an attB site, and an integrase 

10 expression plasmid; 

iv) Incubate at room temperature for 10 mins; 

v) Aliquot 100 jil of the DNA-ceU mixture to a 0.1 cm cuvette at room 
temperature; 

vi) Electroporate at 240 V and 25 |iFD (or 100 V and 125 ^tFD for quail cells) 
15 using, for example, a Gene Pulser II™ (BIO-RAD). 

vii) Incubate the cuvette at room lemperalure for 1-10 mins. 

viii) Before the electroporatcd cells are injected into a recipient embryo, they are 
transferred to a eppendorf tube at room temperature. The cuvette is washed 
with 350 [\1 of media, which is transferred to the eppendorf, spun at room 

20 temperature and re-suspended in 0.01-0.3 ml medium; 

ix) Inject 1-10 jil of cell suspension into the subgerminal cavity of an non- 
irradiated or, for example, an irradiated (e.g., with 300-900 rads) stage X 
egg. vShell and shell membrane are removed and, after injection, resealed 
according to U.S. Patent No. 6,397,777, issued June 6, 2002, the disclosure 

25 of which is incorporated herein by reference in its entirety; and 

x) The egg is then incubated to hatching. 

(c) Blastodermal Cell Culture Medium: 

i) 409.5 mis DMEM with high glucose, L-glutamine, sodium pyruvate, 
pyridoxine hydrochloride; 
30 ii) 5 mis Men non-essential amino acids solution, 10 mM; 
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iii) 



5 mis Penicillin-streptomycin 5000 U/ml each; 
5 mis L-glutamine, 200 mM; 



iv) 



V) 



75 mis fetal bovine serum; and 



vi) 



0.5 mis p-mercaptoethanol, 11.2mM. 



5 



Example 5 ; Transfection of Stage X Embryos with attB Plasmids 

(a) DNA-PEI: Twenty-five |j.g of a phage phiC31 integrase expression plasmid 
(pCMV-int), and 25 jxg of a luciferasc-cxpressing plasmid (pP-actin-GFP-attB) arc 
combined in 200 \}1 of 28 mM Hepes (pH 7.4). The DNA/Hepes is mixed with an 

10 equal volume of PEI which has been diluted 10-fold with water. The DNA/Hepes/PEI 
is incubated at room temperature for 15 mins Three to seven [il of the complex are 
injected into the subgerminal cavity of windowed stage X white leghorn eggs which 
are then sealed and incubated as described in U.S. Patents No. 6,397,777, issued .Tune 
6, 2002. The complexes will also be incubated with blastodermal cells isolated from 

15 stage X embryos which are subsequently injected into the subgerminal cavity of 
windowed iiradiated stage X white leghom eggs. Injected eggs are sealed and 
incubated as described above. 

(b) Adenovirus-PEI: 

Two [ig of a phage phiC3 1 integrase expression plasmid (pCMV-int), 2 jxg of a GFP 
20 expressing plasmid (pP-actin-GFP-attB) and 2 jxg of a luciferase expressing plasmid 
(pGLB) were incubated with 1.2 |il of JetPEFi^ in 50 |ll of 20 mM Hepes buffer 
(pH7.4). After 10 mins at 250C, 3 x lO^ adenovirus particles (Ad5-Null, Qbiogene) 
were added and the incubation continued for an additional 10 mins. Embryos are 
transfected in ovo or ex ovo as described above. 



Example 6 : Stage I Cytoplasmic Iniection 

Production of transgenic chickens by cytoplasmic DNA injection using DNA 
injection directly into the germinal disk as described in Sang et al, MoL Reprod. Dev., 
1: 98-106 (1989); Love et al, Biotechnology, 12: 60-63 (1994) incorporated herein by 
30 reference in their entireties. 



25 
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In the method of the present invention, fertilized ova, or stage I embryos, are 
isolated from euthanized hens 45 mins. to 4 hrs. after oviposition of the previous egg. 
Alternatively, eggs were isolated from hens whose oviducts have been fistulated 
according to the techniques of Gilbert & Wood-Gush, J. Reprod. Fertil., 5: 451-453 
5 (1963) and Pancer et al, Br. Poult. ScT., 30: 953-7 (1989) incorporated herein in their 
entireties. 

An isolated ovum was placed in dish with the germinal disk upwards. Ringer's 
buffer medium was then added to prevent drying of the ovum. Any suitable 
microinjection assembly and methods for microinjecting and reimplanting avian eggs 

10 are useful in the method of cytoplasmic injection of the present invention. A 
particularly suitable apparatus and method for use in the present invention is described 
in U.S. Patent AppUcation Serial No: 09/919,143, published July 31, 2001, the 
disclosure of which is incorporated in its entirety herein by reference. The avian 
microinjection system described in the '143 Application allowed the loading of aDNA 

15 solution into a micropipctLe, followed by prompt positioning of the germinal disk 
under the microscope and guided injection of the DNA solution into the germinal disk. 
Injected embryos could then be surgically transferred to a recipient hen as described, 
for example, in Olsen & Neher, J. Exp. ZooL, 109: 355-66 (1948) and Tanaka et al, J. 
Reprod. Fertil., 100: 447-449 (1994). The embryo was allowed to proceed through the 

20 natural in vivo cycle of albumin deposition and hard-sheU formation. The transgenic 
embryo is then laid as a hard-sheU egg which was incubated until hatching of the 
chick. Injected embryos were surgically transferred to recipient hens via the ovum 
transfer method of Christmann et al in PCT/USOl/26723, published August 27, 2001, 
the disclosure of which is incorporated herein by reference in its entirety, and hard 

25 shell eggs were incubated and hatched. 

Approximately 25 nl of DNA solution (about 60ng/|ll) with either integrase 
mRNA or protein were injected into a germinal disc of stage I White Leghorn embryos 
obtained 90 minutes after oviposition of the preceding egg. Typically the concentration 
of integrase mRNA used was 100 ng/jxl, and the concentration of integrase protein was 

30 66ng/(al. 
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To synthesize the integrase mRNA, a plasmid template encoding the integrase 
protein was linearized at the 3' end of the transcription unit. mRNA was synthesized, 
capped and a polyadenine tract added using the mMESSAGE mMACHINE T7 Ultra 
Kit^M (Ambion, Austin, TX). The mRNA was purified by extraction with phenol and 
5 chloroform and precipitated with isopropanol. The integrase protein was expressed in 
E. coli and purified as described by Thoipe et al, Mol. Microbiol., 38: 232-241 (2000). 

A plasmid encoding for the integrase protein is transfected into the target cells. 
However, since the early avian embryo transcriptionally silent until it reaches about 
22,000 cells, injection of the integrase mRNA or protein was expected to result in 
10 better rates of transgenesis, as shown in the Table 1 below. 

The chicks produced by this procedure were screened for the presence of the 
injected transgene using a high throughput PCR-based screening procedure as 
described in Harvey et al. Nature Biotech., 20: 396-399 (2002). 



15 Table 1: Summary of cytoplasmic injection results using dirferenl integrase strategics 



Experimental 
group 


Ovum 
transfers 


Hard shells 
produced (%) 


Chicks 
hatched (%) * 


Transgenic 
chicks (%) * 


No Integrase 


5164 


3634 (70%) 


500 (14%) 


58 (11.6%) 


Integrase 
mRNA 


1109 


833 (75%) 


115(13.8%) 


19 (16.5%) 


Integrase 
protein 


374 


264 (70.6%) 


47(17.8%) 


16 (34%) 



t . 



Percentages based on the number of hard shells 
Percentages based on the number of hatched birds 



Example 7 ; Characterization of phiC31 Integrase-Mediated Integration Sites in 
20 the Chicken Genome 

To characterize phiC3 1 -mediated integration into the chicken genome, a 
plasmid rescue method was used to isolate integrated plasmids from transfected and 
selected chicken fibroblasts. Plasmid pCR-XL-TOPO-CMV-pur-attB (SEQ ID NO: 
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10, shown in Fig. 18) does not have BamH I or Bgl II restriction sites. Genomic DNA 
from ceUs transformed with pCR-XL-TOPO-CMV-pur-attB was cut with BamH I or 
Bgl II (cither or both of which would cut in the flanking genomic regions) and 
religated so that the genomic DNA surrounding the integrated plasmid would be 
5 captured into the circularized plasmid. The flanking DNA of a number of plasmids 
were then sequenced. 

DF-1 cells (chicken fibroblasts), 4 x 10^ were transfected with 50 ng of pCR- 
XL- lOPO-CMV-pur-attB and 1 jig of pCMV-int. The following day, the culture 
medium was replaced with fresh media supplemented with 1 jxg/ml puromycin. After 

10 10 days of selection, several hundred puromycin-resistant colonies were evident. 
These were harvested by trypsinzation, pooled, replated on 10 cm plates and grown to 
confluence. DNA was then extracted. 

Isolated DNA was digested with BamH I and Bgl IT for 2-3 hrs, extracted with 
phcnol:chloroform:isoamyl alcohol chloroformasoamyl alcohol and clhanol 

15 precipitated. T4 DNA ligase was added and the reaction incubated for 1 hr at room 
temperature, extracted with phenohchloroformdsoamyl alcohol and 
chloroform:isoamyl alcohol, and precipitated with cthanol. 5 |J-1 of the DNA 
suspended in lOjil of water was electroporated into 25 jll of Genehogs''''^ (Invitrogen) 
in an 0.1 cm cuvette using a GenePulser II (Biorad) set at 1.6 kV, 100 ohms, 25 uF and 

20 plated on Luria Broth (LB) plates with 5 jJ.g/ml phleomycin (or 25 jJ.g/ml zeocin) and 
20 jJ.g/ml kanamycin. Approximately 100 individual colonies were cultured, the 
plasmids extracted by standard miniprep techniques and digested with Xba I to 
identify clones with unique restriction fragments. 

Thirty two plasmids were sequenced with the primer attB-for (5'- 

25 TACCGTCGACGATGTAGGTCACGGTC-3') (SEQ ID NO: 12) which allows 
sequencing across the crossover site of attB and into the flanking genomic sequence. 
AU of plasmids sequenced had novel sequences inserted into the crossover site of attB, 
indicating that the clones were derived from plasmid that had integrated into the 
chicken genome via phiC3 1 integrase-mediated recombination. 
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The sequences were compared with sequences at GenBank using Basic Local 
Alignment Search Tool (BLAST). Most of the clones harbored sequences 
homologous to Ciallus genomic sequences in the TRAC^E database. 

5 Example 8 : Insertion of a Wild-Type attP Site into the Avian Genome Augments 
Integrase-Mediated Integration and Transgenesis 

The chicken B-cell line DT40 cells (Buerstedde et al (1990) E.M.B.O. J., 9: 
921-927) are useful for studying DNA integration and recombination processes 
(Buerstedde & Takeda (1991) Cell, 67:179-88). DT40 cells were engineered to harbor 

10 a wild-type attP site isolated from the Streptomyces phage phiC31. Two independent 
cell lines were created by transfection of a linearized plasmid bearing an attP site 
linked to a CMV promoter driving the resistance gene to G418 (DT40-NLB-attP) or 
bearing an attP site linked to a CMV promoter driving the resistance gene for 
puromycin (DT40-pur-attP). The transfected cells were cultured in ihc presence of 

15 Ci418 or puromycin to enrich for cells bearing an attP sequence stably integrated into 
the genome. 

A super-coiled luciferase vector bearing an attB (SEQ ID NO: 2 shown in Fig. 
10) was cotransfected, together with an integrase expression vector CMV-C31int 
(SEQ ID NO: 1) or a control, non-integrase expressing vector (CMV-BL) into wild- 

20 type DT40 cells and the stably transformed lines DT40-NLB-attP and DT40-pur-attP. 

Cells were passaged at 5, 7 and 14 days post- transfection and about one third 
of the cells were harvested and assayed for luciferase. The expression of luciferase 
was plotted as a percentage of the expression measured 5 days after transfection. As 
can be seen in Fig. 21, in the absence of integrase. or in the presence of integrase but in 

25 the DT40 cells lacking an inserted wild-type attP site, luciferase expression from a 
vector bearing attB progressively decreased to very low levels. However, luciferase 
levels were persistent when the luciferase vector bearing attB was cotransfected with 
the integrase expression vector into the attP bearing cell lines DT40-NLB-attP and 
DT40-pur-attP. Inclusion of an attP sequence in the avian genome augments the level 
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of integration efficiency beyond that afforded by the utilization of endogenous pseudo- 
attP sites. 

Example 9: .Generation of attP Transgenic Cell Line and Birds Using an NLB 
5 Vector 

The NLB-attP retroviral vector is injected into stage X chicken embryos laid by 
pathogen-free hens. A small hole is drilled into the egg shell of a freshly laid egg, the 
sheU membrane is cut away and the embryo visualized by eye. With a drawn needle 
attached to a syringe, 1 to 10 ^tl of concentrated retrovirus, approximately 2.5 x lO^IlJ^ 

10 is injected into the subgerminal cavity of the embryo. The egg shell is resealed with a 
hot glue gun. Suitable methods for the manipulation of avian eggs, including opening 
and resealing hard shell eggs are described in U.S. Patent Serial Nos: 5,897,998, 
issued May 27, 1999 and 6,397,777, issued June 4, 2002, the disclosures of which are 
herein incorporated by reference in their entireties. 

15 Typically, 25% of embryos hatch 2 1 days later. Ihc chicks are raised Lo sexual 

maturity and semen samples are taken. Birds that have a significant level of the 
transgene in sperm DNA will be identified, typically by a PCR-based assay. Ten to 
25% of the hatched roosters will be able to give rise to Gl transgenic offspring, 1 to 
20% of which may be transgenic. DNA extracted from the blood of Gl offspring is 

20 analyzed by PGR and Southern analysis to confirm the presence of the intact 
transgene. Several lines of transgenic roosters, each with a unique site of attP 
integration, are then bred to non-transgenic hens, giving 50% of G2 transgetiic 
offspring. Transgenic G2 hens and roosters from the same line can be bred to produce 
G3 offspring homozygous for the transgene. Homozygous offspring will be 

25 distinguished from hemizygous offspring by quantitative PGR. The same procedure 
can be used to integrate an attB or attP site into transgenic birds. 

Example 10; Expression of Immunoglobulin Chain Polypeptides by Transgenic 

Chickens 

30 Bacterial artificial chromosomes (BACs) contaitiing a 70 kb segment of the 
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chicken ovomucoid gene with the hght and heavy chain cDNAs for a human 
monoclonal antibody inserted along with an internal ribosome entry site into the 3' 
untranslated region of the ovomucoid gene were equipped with the attB sequence. The 
heavy and light chain cDNAs were inserted into separate ovomucoid BACs such that 
5 expression of an intact monoclonal antibody requires the presence of both BA(^s in the 
nucleus. 

Several hens produced by coinjection of the attB-bearing ovomucoid BACs 
and integrase-encoding mRNA into stage I embryos produced intact monoclonal 
antibodies in their egg white. One hen, which had a high level of the light chain 
10 ovomucoid BAC in her blood DNA as determined by quantitative PGR particularly 
expressed the light chain portion of the monoclonal antibody in the egg white at a 
concentration of 350 nanograms per ml, or approximately 12 ji,g per egg. 

Example 1 1 ; Stage T Cytoplasmic Tnjection with Tntegrase Activity and PET 

15 Produclion of transgenic chickens by cyloplasmic DNA injection directly into 

the germinal disk was done as described in Example 6. 

DNA (about 60ng/|al) which includes a transgene was placed in approximately 

25 nl of aqueous solution with integrase mRNA or integrase protein and was mixed 

with an equal volume of PEI that had been diluted ten fold. The mixture was injected 
20 into a germinal disc of stage I White Leghorn embryos obtained about 90 minutes after 

oviposition of the preceding egg. Typically the concentration of integrase mRNA used 

was about 100 ng/|al, and the concentration of integrase protein was about 66 ng/jil. 

The integrase mRNA was synthesized according to Example 6. 

Transgenic chicks produced by this procedure using: integrase mRNA/PEI and 
25 integrase protein/PEI showed positive results for the presence of heterologously 

expressed protein in the blood, semen and egg white. 

Example 12; Stage I Cytoplasmic Injection with Integrase Activity and NLS 

Production of transgenic chickens by cytoplasmic DNA injection directly into 
30 the germinal disk was done as described in Example 6. 
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DNA which includes a transgene was suspended in 0.25 M KCl and vSV40 T 
antigen nuclear localization signal peptide (NLS peptide, amino acid sequence 
CGGPKKKRKVG (SEQ ID NO: 13)) was added to achieve a peptide DNA molar 
ratio of 100:1. The DNA (about 60ng/|al) was allowed to associate with the SV40 T 
5 antigen NLS peptide by incubating at 25 degrees C for about 15 minutes. 

Integrase mRNA or integrase protein was added to approximately 25 nl of an 
aqueous DNA/NLS solution, typically, to produce a final concentration of integrase 
mRNA of about 50 ng/|Jl, or an integrase protein concentration of about 33 ng/fil. The 
mixture was injected into a germinal disc of stage I White Leghorn embryos obtained 
10 about 90 minutes after oviposition of the preceding egg. 'ITie integrase mRNA was 
synthesized as according to Example 6. 

Transgenic chicks produced by this procedure using: integrase mRNA/NLS 
and integrase protein/NLS showed positive results for the presence of heterologously 
expressed protein in blood, semen and egg white. 

15 

Example 13: Dispersing of Plasmid DNA in Avian Stage I Embryos 

DNA samples are Cy3 labeled with a Cy3 ULS labeling kit (Amersham 
Pharmacia Biotech). Briefly, plasmid DNA ( 1 jxg) was sheared to approximately 100 
to 500 bp fragments by sonication. Resulting DNA was incubated at 65 "C for 15 min 

20 in Cy3 ULS labeling solution and unincorporated Cy3 dye was removed by spin 
column chromatography (CentriSep, Princeton Separations). The distribution of the 
DNA in stage I avian embryos was visualized after introduction into the stage I avian 
embryo. Enough high molecular weight or low molecular weight PEI was added to the 
DNA to coat the DNA. Typically, PEI was added to the DNA to a concentration of 

25 about 5%. Any useful volume of DNA/PEI can be used, for example about 25 nl. 

Figure 22 shows an avian stage one embryo containing Cy3 labeled naked 
DNA. In Figure 22 it can be seen that the DNA is localized to certain areas of the 
embryo. Figure 23 and Figure 24 show an avian stage one embryo containing Cy3 
labeled DNA coated with low molecular (22 kD) weight PEI (Figure 23) and high 
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molecular weight (25 kD) PEI (Figure 24). In Figures 23 and 24, it can be seen that 
the DNA is dispersed throughout the embryos. 

These experiments show that DNA/PEI conjugates are distributed more 
uniformly in the cytoplasm of injected embryos when compared with naked DNA. 

5 

Example 14; Production of an attP Transgenic Chicken 

GO transgenic chickens have been produced as described in Example 9. 
Several hundred stage X White Leghorn eggs were injected with the NLB-attP vector 
and about 50 chicks hatched. Sperm from approximately 30% of the hatched roosters 
10 has been shown to be positive for the attP site. These hemizygotic chickens are used 
to generate transgenic G2 chickens homozygotic for the attP site. 

Example 15; Cytoplasmic Injection of attP Stage I Embryos with OMC24-attB- 

1RES-CTLA4 

15 Transgenic chickens are produced by cytoplasmic DNA injection directly into 

the germinal disk of eggs laid by transgenic homozygous attP chickens and fertilized 
with sperm from the same line of homozygous attP roosters, the line produced as 
described in Example 14. The cytoplasmic injections are carried out as described in 
U.S. Patent Application Serial No. 09/919,143, filed July 31, 2001, ('143 Application) 

20 and U.S. Patent Apphcation Serial No. 10/251,364, filed September 18, 2002. The 
disclosures of each of these two patent apphcations are incorporated herein by 
reference in their entirety. 

Stage I embryos are isolated 45 mins. to 4 hrs. after oviposition of the previous 
egg. An isolated embryo is placed in a dish with the germinal disk upwards. Ringer's 

25 buffer medium is added to prevent drying of the ovum. The avian microinjection 
system described in the '143 Application allows for the loading of DNA solution into 
a micropipette, followed by prompt positioning of the germinal disk under the 
microscope and guided injection of the DNA solution into the germinal disk. 

Approximately 25 nl of a DNA solution (about 60ng/jil) of the 77 kb OMC24- 

30 attB-IRES-CTLA4, disclosed in US Patent Apphcation No. 10/856,21 8, filed May 28, 
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2004, the disclosure of which is incorporated in its entirety herein by reference, with 
either integrase mRNA or protein are injected into a germinal disc of the isolated stage 
I embryos. Typically, the concentration of integrase mRNA used is 100 ng/|il or the 
concentration of integrase protein is 66 ng/ (il . 
5 To synthesize the integrase mRNA, a plasmid template encoding the integrase 

protein is linearized at the 3' end of the transcription unit. mRNA is synthesized, 
capped and a polyadenine tract added using the mMESSAGE mMACHlNE T7 Ultra 
KitTM (Ambion, Austin, TX). The mRNA is purified by extraction with phenol and 
chloroform and precipitated with isopropanol. The integrase protein is expressed in E . 

10 coh and purified as described by Thorpe et al, Mol. Microbiol., 38: 232-241 (2000). 

Injected embryos are surgically transferred to a recipient hen as described in 
Olsen & Neher, J. Exp. Zool., 109: 355-66 (1948) and Tanaka et al, J. Reprod. Fertil, 
100: 447-449 (1994). The embryo is allowed to proceed through the natural in vivo 
cycle of albumin deposition and hard-shell formation. The transgenic embryo is then 

15 laid as a hard-shell egg which is incubated until hatching of the chick. Injected 
embryos are surgically transferred to recipient hens via the ovum transfer method of 
Christmann et al in PCT/USOl/26723, published August 27, 2001, the disclosure of 
which is incorporated by reference in its entirety, and hard shell eggs are incubated and 
hatched. 

20 The chicks produced by this procedure are screened for the presence of the 

injected transgene using a high throughput PCR-based screening procedure as 
described in Harvey et al. Nature Biotech., 20: 396-399 (2002). Approximately 20% 
of the chicks are positive for the transgene. Eggs from each of the mature hens 

carrying the transgene are positive for CTLA4. 

25 

Example 16: Cytoplasmic Injection of attP Stage I Chicken Embryos with QMIQ- 

attB-CTLA4 

Transgenic chickens are produced by cytoplasmic DNA injection directly into 
the germinal disk of eggs laid by transgenic homozygous attP chickens and fertilized 
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with sperm from the same Une of homozygous attP roosters essentially as described in 
Example 15. 

Approximately 25 nl of a 60ng/(il DNA solution of the OMC24-attB-lRt;S- 
CTLA4 construct of Example 15 with the OMC24 70 kb ovomucoid gene expression 
5 controlling region and IRES of the construct replaced with the 10 kb ovomucoid gene 
expression controlling region of pBS-OVMUP-10, also disclosed in US Patent 
Application No. 10/856,218, filed May 28, 2004, is injected into a fertilized germinal 
disc of stage I embryos along with and integrase protein. The concentration of 
integrase protein used is 66 ng/(il. 
10 hijected embryos are then surgically transferred to a recipient hen, hard shell 

eggs are produced, incubated and hatched. Approximately 30% of the chicks are 
positive for the transgcnc. Eggs from each of the mature hens carrying the transgcne 
are positive for CTEA4. 

15 Example 17: Production of attP Transgenic Quail Using an NLB vector 

The NLB-atlP retroviral vector is injected into stage X quail embryos laid by 
pathogen-free quail. A small hole is drilled into the egg sheU of a freshly laid egg, the 
sheU membrane cut away and the embryo visualized by eye. With a drawn needle 
attached to a syringe, 1 to 10 |al of concentrated retrovirus, approximately 1.0 x 10^ lU, 
20 is injected into the subgerminal cavity of the embryo. The egg shell is resealed with a 
hot glue gun. 

Typically, 25% of embryos hatch. The chicks are raised to sexual maturity and 
semen samples are taken. Birds that have a significant level of the transgene in their 
sperm DNA will be identified, typically by a PCR-based assay. Of the hatched GO 

25 male quail, about 1% to about 20% are transgenic. The transgenic GO quail are bred to 
nontransgenic quail to produce hemizygotic Gl offspring. DNA extracted from the 
blood of Gl offspring is analyzed by PCR and Southern analysis to confirm the 
presence of the intact transgene. Several lines of hemizygotic transgenic male quail, 
each with a unique site of attP integration, are then bred to non-transgenic quail giving 

30 G2 offspring, 50% of which are transgenic. Transgenic G2 male and female from the 
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same line are then bred to produce G3 offspring homozygous for the treinsgene. 
Homozygous offspring are distinguished from hemizygous offspring by quantitative 
PGR. 

5 Example 18: Cytoplasmic Injection of attP Stage I Quail Embryos with OMC24- 

attB-lRES-G-CSF 

Transgenic quail are produced by cytoplasmic DNA injection directly into the 
germinal disk of eggs laid by fuUy transgenic homozygous attP quail produced as 
described in Example 17. The cytoplasmic injections are carried out essentially as 

10 described in the '143 Application and U.S. Patent Application Serial No. 10/251,364, 
filed September 18, 2002. 

Stage I embryos from homozygous attP quail fertilized with sperm from a 
homozygous attP quail are isolated approximately 90 minutes after oviposition of the 
previous egg. An isolated embryo is placed in a dish with the germinal disk upwards. 

15 Ringer's buffer medium is added to prcvcnl drying of the t)vum. The avian 
microinjcclion system described in the '143 Application is used to inject 
approximately 25 nl of a DNA solution (about 60ng/|J,l) of OMC24-attB-lRES- 
CrLA4, with the CTLA coding sequence replaced with the coding sequence for a 
human-granulocyte colony stimulating factor, and integrase protein into the germinal 

20 disc of the stage I quail embryos. The concentration of integrase protein used is 66 
ng/fH. 

Injected embryos are surgically transferred to a recipient quail essentially as 
described in Olsen & Neher, J. Exp. Zool, 109: 355-66 (1948) and Tanaka et al, J. 
Reprod. Fertil., 100: 447-449 (1994). The embryo is allowed to proceed through the 
25 natural in vivo cycle of albumin deposition and hard-shell formation. The transgenic 
embryo is then laid as a hard-shell egg which is incubated until hatching of the chick. 

The chicks produced by this procedure are screened for the presence of the 
injected transgene using a high throughput PCR-based screening procedure as 
described in Harvey et al, Nature Biotech., 20: 396-399 (2002). Approximately 20% 
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of the chicks are positive for the transgene. Eggs from each of the mature female quail 
carrying the transgene are positive for G-CSF. 

Example 19; Generation of attP Transgenic Duck Using an NLB vector 

5 The NI.B-attP retroviral vector is injected into stage X Duck embryos laid by 

pathogen-free Ducks. A small hole is drilled into the egg shell of a freshly laid egg, 
the shell membrane cut away and the embryo visualized by eye. About 1 to 10 |al of 
concentrated retrovirus, approximately 2.5 x 10^ lU, is injected into the subgerminal 
cavity of the embryo. The egg shell is resealed with a hot glue gun. 
10 Homozygous G3 offspring are obtained essentially as described in Example 17 

for quail. 

Example 20: Stage I Cytoplasmic Injection of attF Stage I Duck Embryos with 
QM24-attB-IRES-CTLA4 

15 Transgenic ducks are produced by cytoplasmic DNA injection directly inU) the 

germinal disk of eggs laid by homo/ygous altP ducks fertilized with sperm from 
homozygous allP ducks. The injection of the stage I embryos is carried out essentially 
as described in the '143 Application and U.S. Patent Application Serial No. 
10/251,364, filed September 18, 2002. Approximately 25 nl of a DNA solution (about 

20 60ng/|il) of OMC24-attB-IRES-CTLA4, with the CTLA4 coding region replaced with 
a coding sequence for human erythropoietin, and integrase encoding mRNA and 
protein is injected into the germinal disc of the stage I embryos. The concentration of 
integrase mRNA used is 100 ng/jol. The injected embryos are surgically transferred to 
a recipient duck and the embryo is allowed to proceed through the natural in vivo cycle 

25 of albumin deposition and hard-shell formation. The transgenic embryo is laid as a 
hard-shell egg which is incubated until hatching and the chicks are screened for the 
presence of the injected transgene. Approximately 20% of the chicks are positive for 
the transgene. Eggs from each of the mature female ducks carrying the transgene are 
positive for erythropoietin. 

30 
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Example 21; _ Production of Transchromosomie Chickens Using Satellite DNA- 
Based Artificial Chromosomes 

Satellite DNA-bascd artificial chromosomes (ACKs, as described in 
Lindenbaum et al Nucleic Acids Res (2004) vol 32 no. 21 el 72) were isolated by a 
5 dual laser high-speed flow cytometer as described previously (de Jong, G, el al. 
Cytometry 35: 129-133, 1999). 

The flow-sorted chromosomes were pelleted by centrifligation of a 750|H 
sample containing approximately 10^ chromosomes at 2500 x g for 30 min at 4^C. 
The supernatant, except the bottom 30 microliters (jil) containing the chromosomes, 
10 was removed resulting in a concentration of about 7000 to 11,500 chromosomes per |al 
of injection buffer (Monteith, et al. Methods MoT Biol 240: 227-242, 2004). 
Depending on the number of chromosomes to be injected, 25-100 nanoUters (nl) of 
injection buffer was injected per embryo. 

Embryos for this study were collected from 24-36 week-old hens from 
15 commercial White Leghorn variety of G. gallus. Embryo donor hens were inseminated 
weekly using pooled semen from roosters of the same breed to produce eggs for 
injection. 

On the day of egg collection, fertile hens were euthanized 2h post oviposition 
by cervical dislocation. Typically, oviposition is followed by ovulation of the next egg 
20 after about 24 minutes (Morris, Poultry Science 52: 423-445, 1973). The recently 
ovulated and fertilized eggs were collected from the upper magnum region of the 
oviduct under sterile conditions and placed in a glass weU and covered with Ringers' 
Medium (Tanaka, et al. J Reprod Eertil 100: 447-449, 1994) and maintained at 4lOC 
until microinjection. 

25 Cytoplasmic injection of artificial chi-omosomes was achieved using the 

microinjection apparatus disclosed in US Patent Application No. 09/919,143, filed 
July 31, 2001. Chromosomes were injected into the Stage I embryos at a single site. 
Each embryo was cytoplasmically injected with approximately: 175, 250, 350, 450, 
550, 800 or >1000 chromosomes. The chromosomes were injected in a suspension of 

30 25-100 nanoUters (nl) of injection buffer. 
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Following microinjection, the embryos were transferred to the oviduct of 
recipient hens using an optimized ovum transfer (OT) procedure (Olsen, M and Neher, 
B. J Exp Zool 109: 355-66, 1948), with the exception that the hens were anesthetized 
by Isofluoranc gas. Typically, about 26h after O'l', the recipient hens lay a hard shell 
5 egg containing the manipulated ovum. Eggs were incubated for 2 1 days in a regular 
incubator until hatching of the birds. 

The chromosomes were injected into the embryos over a 9 day period. The 
chromosomes were divided into three batches for delivery to the embryos each batch 
being injected over a three day period. (Chromosomes were introduced into the 
10 embryos by a single injection using the microinjection assembly disclosed in the '143 
patent application. Following injection, each egg was transferred to a recipient hen. A 
total of 301 transfers were performed, resulting in 226 (75%) hard shells and 87 
hatched chicks (38%, sec Table 2). 

15 Table 2: Hatching of embryos microinjected with satellite DNA-based artificial 
chromosomes. 





Ovum 


Hard shells 


hatched birds 




transfers 


produced 




batch 


71 


53 


15 


batch 


113 


80 


33 


3'<i batch 


117 


93 


39 


Totals 


301 


226 (75%) 


87 (38%) 



Previous experiments have determined that hatching is not significantly 
20 affected when embryos were injected with up to lOOnl of injection buffer. Satellite 
DNA-based artificial chromosomes were injected in suspensions of between 25-lOOnl 
of injection buffer. 

As discussed, the embryos were injected with one of seven different numbers 
of artificial chromosomes. There was shown to be a correlation between the number 
25 of chromosomes injected per egg and the hatch rate. AU transchromosomic birds in 
the present study were obtained from embryos injected with 550 chromosomes or less 
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(see Table 3). There was no significant difference in the hatching rates observed 
between the experimental groups (batches 1^2 and 3). 

Six transchromosomic founders were produced based on two separate PGR 
analysis (6.8%, see Table 3) using primers which anneal to the puromycin resistance 
5 gene (about 75 copies of the pur^ gene are present on the chromosome. All positive 
birds appear normal. 



Table 3 : Effect of the number of Chromosomes injected per embryo on hatching 



# chromosomes 


# of hard 


# chicks 


# of positive 


injected 


shells 


hatched 


birds (bird tag #) 


per embryo 








175 


31 


1 1 (35%) 


3 (BB7478, BB7483, BB7515) 


250 


51 


25 (49%) 


1 (BB 7499) 


350 


15 


6 (40%) 


0 


450 


31 


11 (35%) 


0 


550 


39 


17 (43%) 


2 (BB7477, BB7523) 


800 


26 


5(19%) * 


0 


1000 


33 


10 (30%) * 


0 


Totals 


226 


87 (38%) 


6 (6.8%) 



10 *: hatching rates of embryos injected with >550 chromosomes was significantly lower (p < 
0.025) 



To confirm the PGR results, erythrocytes from all PGR-positive birds as well 
15 as fibroblast cells derived from skin biopsies of 5 PGR-positive birds were analyzed by 
interphase and metaphase FISH using a mouse-specific major satelhte DNA probe 
(Go, ct al. Ghromosomc Res 8: 183-191, 2000). Five of the six chicks (5.3% out of 
total number of chicks analyzed) tested by FISH were positive in at least one cell type 
(see Table 4) at 3 weeks of age. FISH analysis of erythrocytes was repeated when the 
20 birds reached 8 weeks of age and had tripled their body weight. Similar numbers of 
artificial chromosome-positive cells found in each bird were observed in this second 
FISH analysis. 



25 
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Table 4: Summary of FISH analysis of Red Blood Cells (RBCs) and fibroblast 
cells derived from transchromosomic birds. Fibroblast cells from hen # 7515 were 
not available for analysis. 



Rird # 


Sex of 


/V %Jl ctl I'lllCICtl 


/V \Jl ctl I'lI.IdCtI 




Bird 


chromosome 


chromosome positive 






positive 


fibroblasts by FISH 






RBCs by FISH 




BB7499 


Female 


77% 


87% 


BB7483 


Female 


0.8% 


0% 


BB7477 


Male 


3% 


2.8% 


BB7478 


Male 


15% 


3% 


BB7515 


Female 


1.3% 


NA 


BB7523 


Male 


0% 


0% 


Neg. conttol 




0% 


0% 



5 

To verify the chromosomes were intact, metaphase spreads from fibroblast 
cells derived from founders were made as described previously ((larside and Ilillman 

10 (1985) Experientia 41: 1183-1184). FISH analysis of metaphase spreads using the 
major satellite DNA probe showed the artificial chromosomes appear intact, with no 
apparent fragmentation or translocation onto the chicken's chromosomes. FISH 
analysis using a mouse minor satelUte probe, which detects the centromeric region of 
the introduced chromosomes (Wong and Rattner (1988) J. Nucleic Acids Res 16: 

15 11645-11661), demonstrated the centromere of the chromosomes was intact. 
Furthermore, the percentage of satellite DNA-based artificial chromosomes -positive 
cells from metaphase spreads agreed closely to those observed in interphase FISH. 

Analysis of Gl embryos from test bird BB7499 has shown the artificial 
chromosome to be transmitted through the germline. In addition, sperm from BB7499 

20 was shown to test positive for the artificial chromosome which will also provide for 
germline transmission of the artificial chromosome. 
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Example 22: Production of _EPO_ and G-CSF Vectors for the Production of 
Transchromosomic Chickens 

Two vectors were constructed for introduction into Satellite DNA-based 
artificial chromosomes. 10MC24-IRES1-EPO - CliromattB was constructed by 
5 inserting an EPO coding sequence into an ()M(]24-IRES BAC^ clone disclosed in US 
Patent Application No. 10/856,218, filed May 28, 2004, the disclosure of which is 
incoiporated in its entirety herein by reference. The EPO coding sequence was 
inserted in the clone so as to be under the control of the ovomucoid promoter. That is, 
the EPO coding sequence was inserted in place of the LC portion of OMC-IRES-LC. 

1 0 An attB site and a hyrgromycin"'' coding sequence were also inserted into the vector in 
such a manner as to facilitate recombination into an attP site in a SATAC artificial 
chromosome (i.e., ACE), as see in FIG. 25. The attP site in the SATAC is located 
adjacent to an SV40 promoter which provides for expression of the hygromycin^* 
coding sequence upon integration of the vector into the attP site allowing for selection 

15 of cells conlaining a recombinant artificial chromosome (see, for example, US Patent 
No. 6,743,967, issued June l. 2004; US Patent No. 6,025,155, issued February 15, 
2000 and Lindenbaum et al Nucleic Acids Res (2004) vol 32 no. 2 1 el72 (see FIG. 
25), the disclosure of each of these two patents and the publication are incorporated in 
their entirety herein by reference). 

20 A coding sequence for G-CSF, which was codon optimized for expression in 

chicken tubular gland cells, was inserted in the 10MC24-IRES1-EPO - ChromattB 
construct in place of the EPO coding sequence to produce 10MC24-IRES-GCSF - 
("hrommattB. 

25 Example 23; Production of Erythropoietin and G-CSF Using Artificial 

Chromosomes in Chickens 

Cecils containing the recombinant artificial chromosome are produced and 
identified as described in Lindenbaum et al Nucleic Acids Res (2004) vol 32 no. 2 1 
el72. Briefly, 2.5 |Xg of 10MC24-IRES1-EPO ChromattB and 2.5 (Xg of an 
30 expression vector which contains a lambda integrase gene (int) having a codon 
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mutation at position 174 to substitute a lysine for a glutamine (pCXLamROK, see 
Lindenbaum et al Nucleic Acids Res (2004) vol 32 no. 2 1 el72) are transfected by 
standard lipofection methodologies into LMTK- cells which contain the platform 
vSATAf] (A(^E) (A of riG. 25). DNA comprising the nucleotide sequence of interest, 
5 in this case 10MC24-1RES 1-EPO CliromattB, that has been highly purified, for 
example, utilizing a CsCl gradient centrifugation as is well known in the art, is 
particularly useful, though not required. Hygromycin resistant cells clones are 
identified by standard antibiotic selection methodologies. 

Recombinant chromosomes are prepared from the cells and isolated by flow 
10 cytometry. The substantially purified artificial chromosomes are introduced into 
chickens by microinjection into stage one embryos as disclosed in US Patent 
Application Nos. 10/679,034, filed October 2, 2003 and 09/919,143, filed July 31, 
2001. Resulting chimeric germline transchromosomal avians can be identified by any 
useful method such as Southern blot analysis. 

15 

Example 24; Production of a Monoclonal Antibody Using Drosophila Artificial 
Chromosomes in Turkey 

Artificial chromosomes comprising a Drosophila chromosome centromere 
(DAC) are prepared essentially using methods described in US Patent No. 6,025,155, 
20 issued February 15, 2000, the disclosure of which is incorporated in its entirety herein 
by reference. 

An attB site and a hyrgromycin^ coding sequence are inserted into the 
OMC24-IRES-LC and OMC24-IRES-HC vectors disclosed in US Patent Application 
No. 10/856,218, filed July 31, 2001, the disclosure of which is incorporated in its 
25 entirety herein by reference, which are then each cloned into a DAC essentially as 
described in Examples 22 and 23. The recombinant DACs arc prepared and then 
isolated by a dual laser high-speed flow cytometer. 

The flow-sorted chromosomes are pelleted by centrifugation and are diluted to 
a concentration of about 7000-12,000 chromosomes per |H of injection buffer. 
30 Approximately 50 nanolitcrs (nl) of injection buffer is injected per turkey embryo. 
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Embryos for this study are collected from actively laying commercial turkeys. 
Embryo donor turkeys are inseminated weekly using pooled semen from male turkeys 
of the same breed to produce eggs for injection. 

On the day of egg collection, fertile hens are eutheinized 2h post oviposition by 
5 cervical dislocation. The recently ovulated and fertilized eggs are collected from the 
upper magnum region of the oviduct under sterile conditions and placed in a glass well 
and covered with Ringers' Medium and maintained at about 40PC until microinjection. 

Cytoplasmic injection of artificial chromosomes containing the OMC24-IRES- 
LC is achieved using the microinjection apparatus disclosed in US Patent Application 
10 No. 09/919,143. Approximately 500 chromosomes are injected into the Stage I 
embryos at a single site. 

Following microinjection, the embryos are transferred to the oviduct of 
recipient turkeys essentially as described in Olsen et al, B. J Exp Zool 109: 355-66, 
1948. Typically, about one day after OT, the recipient turkeys lay a hsird shell egg 
15 conLaining Ihe manipulated ovum. Eggs are incubated in an incubator until hatching 
of the birds. 

G2 transchromosomal turkeys are obtained which contain the artificial 
chromosome in their genome. The artificial chromosome containing the OMC24- 
IRES-HC is introduced into embryos obtained from the G2 turkeys in essentially the 

20 same manner as described for the OMC24-IRES-LC. 

Eggs from Gl transchromosomal turkeys which contain both the OMC-IRES- 
LC and OMC24-IRES-HC containing chromosomes in their genome are tested for the 
presence of intact functional monoclonal antibody. A Costar flat 96-well plate is 
coated with 100 |al of C GoaL-anLi-Human kappa al a concentration of 5 |J.g/ml in PBS. 

25 The plate is incubated at 37 for two hours. 200 |ll of 5% PBA is added to the wells 
followed by an incubation at 37 °C for about 60-90 minutes followed by a wash. 100 
[il of egg white samples (diluted in 1% PBA:LBP) is added to each well and the plate 
is incubated at 37 °C for about 60-90 min followed by a wash. 100 Jil of a 1:2000 
dilution of F2 Goat anti-Human IgG Fc-AP in 1% PBA is added to the wells and the 

30 plate is incubated at 37 for 60-90 min followed by a wash. The antibody is 
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detected by placing 75 jil of Img/ml PNPP (p-nitrophenyl phosphate) in 5x developing 
buffer in each well and incubating for about 10-30 mins at room temperature. The 
detection reaction is stopped using 75ul of IN NaOH. The egg white tests positive for 
significant levels of the antibody. 

5 

Example 25: ^Production of Interferon Using Avian Artificial Chromosomes in 
Quail 

Artificial chromosomes comprising a chicken (Barred-Rock) chromosome 
centromere (CAC) are prepared essentially using methods described in US Patent No. 
10 6,743,967, issued .Tune 1, 2004, the disclosure of which is incorporated in its entirety 
herein by reference. 

A coding sequence for interferon alpha 2b disclosed in US Patent Application 
No. 10/463,980, filed June 17, 2003, the disclosure of which is incorporated in its 
entirety herein by reference, is inserted in the 10MC24-IRESl-Epo - ChromattB 
15 construct disclosed herein in I'^xaniple 22 in place of the \lVO coding sequence to 
produce 10MC24-1RES-1NF - (^hrommaLLB. The 10MC24-1RES-1NF - ChrommauB 
is cloned into the CACs esscniially as described in Example 23. The recombinant 
CACs are prepared then isolated by a dual laser high-speed flow cytometer. 

The flow- sorted chromosomes are pelleted by centrifugation and are diluted to 
20 a concentration of about 10,000 chromosomes per [\1 of injection buffer. 
Approximately 50 nanoUters (nl) of injection buffer is injected per quail embryo. 

Embryos for this study are collected from actively laying quail. Embryo donor 
quail are inseminated weekly using pooled semen from male quail of the same breed to 
produce eggs for injection. 
25 On the day of egg collection, fertile quail ai-e euthanized 2h post oviposition by 

cervical dislocation. The recently ovulated and fertilized eggs are collected from the 
upper magnum region of the oviduct under sterile conditions and placed in a glass well 
and covered with Ringers' Medium and maintained at about40PC until microinjection. 
Cytoplasmic injection of artificial chromosomes is achieved using the 
30 microinjection apparatus disclosed in US Patent AppUcation No. 09/919,143, filed 
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July 31, 2001. Chromosomes are injected into the Stage I embryos at a single site in 
each embryo. 

Following microinjection, the embryos are transferred to the oviduct of 
recipient quail essentially as described in Olsen et al, B. J Exp Zool 109: 355-66, 1948. 
5 Typically, about one day after OT, the recipient quail lay a hard shell egg containing 
the manipulated ovum. Eggs are incubated in an incubator until hatching of the birds. 

Eggs from G2 transchromosomal quail test positive for the presence of intact 
functional interferon alpha 2b. 

10 Example 26; Production of Monoclonal Antibody Using Avian Artificial 
Chromosomes in Chicken 

An attB site and a hyrgromycin^ coding sequence are inserted into the 
OMC24-IRES-LC and OMC24-IRES-HC vectors disclosed in US Patent AppUcation 
No. 10/856,218, filed July 31, 2001, which are then each cloned into CACs of 
15 Example 25 essentially as described in Examples 22 and 23. I'hc CACs are isolated 
by a dual laser high-speed flow cytometer. 

The flow-sorted chromosomes are pelleted by centrifugation and are diluted to 
a concentration of 7000-12,000 chromosomes per jil of injection buffer. 
Approximately 50 nanoliters (nl) of injection buffer is injected per chicken embryo. 
20 Embryos for this study are collected from actively laying G. gallus. Embryo 

donor chickens are inseminated weekly using pooled semen from male chickens of the 
same breed to produce eggs for injection. 

On the day of egg collection, fertile hens are euthanized 2h post oviposition by 
cervical dislocation. The recently ovulated and fertilized eggs arc collected from the 
25 upper magnum region of the oviduct under sterile conditions and placed in a glass well 
and covered with Ringers' Medium and maintained at about 4 l^C until microinjection. 

Cytoplasmic injection of artificial chromosomes containing the OMC24-IRES- 
LC is achieved using the microinjection apparatus disclosed US Patent Application 
No. 09/919,143. Approximately 500 chromosomes are injected into the Stage 1 
30 embryos at a single site. 
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Following microinjection, the embryos are transferred to the oviduct of 
recipient chickens essentially as described in Olsen et al, B. J Exp Zool 109: 355-66, 
1948. Typically, about one day after OT, the recipient chickens lay a hard shell egg 
containing the manipulated ovum. Eggs are incubated in an incubator until hatching 
5 of the GO birds. 

G2 transchromosomal chickens are obtained which contain the artificial 
chromosome in their genome. The artificial chromosome containing the OMC24- 
IRES-HC is introduced into embryos obtained from the G2 chickens in essentially the 
same manner as described for the OMC24-IRES-LC. 

10 Eggs from Gl transchromosomal chickens which contain both the OMC-IRES- 

LC and OMC24-IRES-HC in their genome are tested for the presence of intact 
functional monoclonal antibody. A Costar flat 96-wcll plate is coated with 100 ul of C 
Goat- anti -Human kappa at a concentration of 5 |J.g/ml in PBS. The plate is incubated 
at 37 OC for two hours. 200 |J.l of 5% PEA is added to the wells followed by an 

15 incubation at 37 °C for about 60-90 minutes followed by a wash. 100 ul of egg white 
samples (diluted in 1% PBA:LBP) is added to each well and the plate is incubated at 
37 OC for about 60-90 min foUowed by a wash. 100 ul of a 1:2000 dilution of F2 
Goat anti-IIuman IgG Fc-AP in 1% PBA is added to the wells and the plate is 
incubated at 37 °C for 60-90 min foUowed by a wash. The antibody is detected by 

20 placing 75 ul of Img/ml PNPP (p-nitrophenyl phosphate) in 5x developing buffer in 
each well and incubating for about 10-30 mins at room temperature. The detection 
reaction is stopped using 75ul of IN NaOH. The egg white tests positive for 
significant levels of the antibody. 

25 Example 27; Cell culture and transfection for the production of an insert 
containing artificial chromosome and screening for positive clones 

pK161 is a cosmid containing a 8.2 kb mouse rDNA insert. The plasmid is 
produced as disclosed in Csonka et al 2000, Journal of Cell Science 113, 3207-3216. 
100 JXg of cosmid pK161 is digested with Cla I, purified by phenol/chloroform 
30 extraction and ethanol precipitation then resuspended at approximately 1 fJ-g/fJl in TE, 
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pH 8.0. YAC DNA containing the human light-chain £ind heavy-chain 
immunoglobulin loci shown in Fig 27A and 27B are prepared as disclosed in Example 
30. 

LMTK- cells (obtained from ATCC) are cultured at 37°C in 5% C02 in a 
5 humidified incubator in DMEM (Invitrogen), 10% FBS (Ilyclon) (LMTK- media). 
Prior to the day of transfection, ten 10 cm plates are seeded with approximately 2 x 10^ 
cells per dish. Transfection with ExGen 500 (i.e., sterile 0.01 mM 22 kDa 
polyethylenimine (PEI), Fermentas Life Sciences) can be performed according to the 
manufacturers instructions or as foUows. 

10 On the day of the transfection, LMTK- cells are washed once with 3 ml of 

Optimem and the media is replaced with 6 ml of Optimem. In an eppendorf tube, 250 
(Xl of HBS (150 mM NaCl, 20 mM HEPES, pH 7.4) is mixed with 3.6 (il of ExGen 
500. In a second tube, 250 |il of HBS is mixed with 6 |lg of linearized pFK161, 3.0 |lg 
of gel-purified kappa light chain YAC and 3.0 |J,g of gel-purified heavy chain YAC. 

15 The PLl (lix(icn) mixture is added dropwise to the DNA mixture, without mixing of 
the two solutions. 

After incubation at RT for 10 min, the solution is gently mixed by pipeting up 
and down with a widc-borc pipct 3 times. 50 Jil of the transfection mix is added to 
each 10 cm dish of LMTK- cells and the plates are swirled to distribute the DNA/PEI 
20 complexes. 4-6 hours post-transfection, the media is replaced with 10 ml of LMTK- 
media. 48 hours post-transfection, the media is replaced with LMTK- media plus 200 
JXg/ml G418 (Geneticin, Invitrogen). The selective media is replaced every 2-3 days 
until colonies are apparent. 

Fifty G418-resistant colonies are isolated with cloning cylinders and are 
25 transferred to single wells in 24-well tissue culture plates. When the clones are at or 
near confluency, they trypsinized and split into three 24-well plates. 

To determine which clones carry a desired artificial chromosome, metaphase or 
interphase FISH is performed. Purified light chain YAC DNA is labeled with biotin- 
14dCTP by random priming (Bioprime DNA labeling system, Invitrogen). The heavy 
30 chain YAC DNA is labeled with digoxigenin-1 IdUTP by random priming (Dig High 
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Prime, Roche Diagnostics). The heavy and Ught chain YAC probes are mixed and 
hybridized to metaphase chromosomes or interphase nuclei. The hybridized biotin 
signals are made visible with fluorescein labeled avidin, and the digoxigenin signals 
are visualized with rhodamine labeled anti-digoxigenin antibody following standard 
5 protocols. The nuclei or chiomosomes are counterstained with DAPl and visualized 
on an Olympus 1X70 microscope configured with DAPI, FITC and rhodamine 
fluorescent excitation filters. 

Two clones are found to have an episomal element indicative of an artificial 
chromosome. Both clones are positive for the heavy and light chain YACs, indicating 
10 that both YACs are incorporated into the artificial chromosomes. The artificial 
chromosomes are believed to be satelhte artificial chromosomes. 

Example 28: Copy number determination of ig loci inserts and determination of 
structural integrity of the loci in the artificial chromosomes 

15 In order to simplify the interprclalion of the analysis of structural inlegriLy of 

the Ig containing YACs, it is desirable to obtain artificial chromosomes which carry 
one copy of each YAC. Real time PCR using Taqman® chemistry is utilized to 
identify clones containing a single copy of the YACs. Several primer/probe sets are 
designed to detect each YAC. The aniplicon detection probes are labeled using FAM 

20 as the dye and TAMRA as the quencher. 10 ng of genomic DNA purified from the 
positive clones that are identified in Example 30 are assayed in a 30 Jil reaction using 
the TaqManCR) Fast Universal PCR Master Mix, No AmpEraseCR) UNCI and 79()()HT 
(Applied Biosystems). AmpUfication curves are compared to standards that are 
composed of differing amounts of purified YACs in the presence of 10 ng of LMTK- 

25 DNA. Both positive clones of Example 27 appear to have a single copy of the light 
chain YAC as is indicated by overlap of the amplification curves and the Ct value 
relative to the standard curve. One clone appears to have two or more copies of the 
heavy chain YAC as the amplification curve had a Ct that is 4 cycles less than the 
other clone. The other clone appears to have a single copy of the heavy chain YAC. 

30 The clone containing one copy of each YAC (clone SC) is selected for further analysis. 
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PCR primers are designed to amplify 300-500 bp regions of each YAC which 
are complementary to restriction fragments to be detected in the Southern blot 
analysis. PCR products are gel purified and quantitated by the Picogreen Assay 
(Molecular Probes). Radiolabeled probes are generated by random priming using 
5 dcoxycytidinc 5'-La-32PJ triphosphate and the Rcdiprimc 11 Random Priming Jdt 
(Amersham). 

Cells of clone SC are embedded in agarose plugs and subjected lo DNA release 
and restriction digestion according to standard protocols. Several enzymes that cut the 
YACs into 20 to 150 kb segments are used including Asc I, Pac 1 and Sbf I. The 

10 digested plugs are loaded in multiple lanes such that rephcate membranes can be cut 
from a single membrane. The digested DNAs are separated by PFGE (CHEF) on a 
0.8% agarose gel in TAE buffer (switch time I = I s, switch time 2 = 25 s, 4 V/cm, 15 
to 20 h, 14°C). The gel is transferred to a UV crosslinker (Stratagene) and exposed to 
120 mJ UV radiation. The gel is denatured in 1.5 M NaCl, 0.5 M NaOH for 30 

15 minutes at RT and neutralized in 1.3 M NaCl, 1.0 M Tris base, pll 7.4 for 40 minutes 
at RT. The gel is transferred by capillary action Lo Genescreen Plus® nylon membrane 
in lOX SSPIi for one to three days. I'hc membrane is briefly rinsed in 2X SSPli and 
cross-linked with 120 mJ UV radiation (Stratagene). The membrane is cut into 
replicate pieces and is transferred to roller bottles (Bellco). The membranes are 

20 prehybridized in hybridization buffer ( 1.25X SSPE3 0.625% SDS, 40% formamide, IX 
Denhardts, 10% dextran sulfate, 0.05 mg/ml denatured salmon sperm DNA) for 2-6 
hours at 42°C. The hybridization buffer is changed with new buffer and the 
appropriate probe is added. The membranes are hybridized overnight at 42°C. The 
next day the membranes are washed with 0.2X SSPE, 1% SDS or 0.02X SSPE, 1% 

25 SDS at 42°C to 65°C until the CPM of each membrane is 400 or less. Membranes are 
wrapped in Saran Wrap® and exposed one to three days to BioMax MS^m film with a 
BioMax TranScreen he™ intensifying screen at — 80°C. Clone SC is found to have 
restriction fragments which demonstrate the structural integrity of both YACs; i.e., no 
rearrangement of the YACs is apparent. 

30 
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Example 29; Purification of Ig loci containing artificial chromosome and analysis 
of human immunoglobulin produced in transgenic avians 

Artificial chromosomes are purified from clone SC by flow cytometry and are 
used for cytoplasmic injections of stage T White leghorn embryos essentially as 
5 disclosed in Example 21. 500 embryos are injected with between 100 and 1000 
artificial chromosomes. 135 chicks hatch and are analyzed for the presence of the 
transgene in their blood DNA. DNA is extracted as disclosed in US Patent No. 
6,423,488, issued July 23, 2002. 100 ng of DNA is analyzed by real-time PCR using 
probes to detect the heavy and light chain YACs as disclosed in Example 31. Five 

10 birds are found to be positive for the clone SC artificial chromosome at significant 
levels (> 1 copy of the artificial chromosome for every 100 genomic equivalents). 

Scrum from hatched birds and eggs from mature hens arc analyzed for human 
TgA, and IgEc levels by RIXSA. vSeveral birds are positive for both human IgA. and Igl c 
in their serum, indicating that human IgG is produced in the serum. Eggs from GO 

15 hens are collected and the yolks removed. Yolk is diluted and analyzed for human IgA, 
and IgFc levels by ELISA. Several hens contain human IgG in the yolk of their eggs. 

Gl birds are produced from the GO birds as disclosed herein. Each of the 
positive Gl birds include the artificial chromosome in substantially all of their somatic 
cells as demonstrated by FISH. The germhne transgenic Gl birds produce substantial 

20 quantities of polyclonal antibodies which are deposited in the egg. For example, 
human polyclonal antibody is present in an amount greater than about 10 Ji.g/cgg or 
greater than about 0.1 mg/egg. 

Example 30; Isolation and characterization of human immunoglobulin loci YACs 
25 Two YACs that contain substantial portions of the human light-chain and 

heavy-chain immunoglobulin loci are shown in Fig. 27A and 27B. These constructs 
contain multiple variable, D, J and constant regions, as well as elements required for 
gene expression, gene rearrangement and constant chain switch. The lambda light- 
chain construct, IgLambda, is a 410 kb YAC that has been previously used to express 
30 human polyclonal antibodies in transgenic mice. See, for example, US patent 
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applicalion No. 2004/0231012, published November 18, 2004 and Popov el al (1999) 
J. Exp. Med. 189:1611-1619, the disclosures of which are incorporated in their 
entireties herein by reference. The heavy-chain construct, IgHeavy-2, is a 300 kb 
derivative of the YAC shown in Fig. 27A that has been used to express human 
5 polyclonal in mice (Nicholson et al (1999) J Immunol 163:6898-6906) to which a 
functional human gamma-constant gene segment has been added 3' of the C5 region. 

YAC containing strains of Saccharomyces cerevisiae were grown in a yeast 
nitrogen base medium with 2% glucose and an appropriate selective amino acid at 
3()"C' for 4 days. Total DNA agarose plugs were prepared from the yeast strains using 

10 the protocol of ladonato, S. P., and A. Gnirke (1996) modified as follows: 

Yeast cells were centrifuged, washed with 50 mM EDTA pH 8 and 
resuspended at 2 x 10^ cells/ml in 50 mM EDTA pH 8. The cell suspension was 
heated to 45-50"C and added to an equal volume of 2% LMP agarose that had been 
melted and brought to 45-50V-. Cells and agarose were mixed and dispensed into plug 

15 molds which were then placed at 4*^. Hardened plugs were placed in sphcroplasling 
solution ( 1 M sorbilal, 20 mM RDTA, 10 mM Tris-IlCl pll 7.5, 14 mM 
mercaptoclhanol, 3% lyticase solution (#170-3593 Bio-Rad)) at 37"C lor 4 hours with 
gentle agitation. Plugs were then washed in LDS solution (1% lithium dodecyl sulfate, 
100 mM EDTA pH 8, 10 mM Tris-HCl pH 8) for 15 minutes and were then placed in 

20 LDS solution for 16 hours at 370C with gentle agitation. Plugs were then washed 3 
times for 30 minutes in NDS solution (500 mM EDTA, 10 mM Tris base, 1% sarkosyl 
pH 9) and 5 times for 30 minutes with TE (10 mM Tris-HCl pH 8, 1 mM EDTA pH 8) 
with gentle agitation. 

The intact YACs were separated by contour-clamped homogeneous electric 

25 field (CHEF) electrophoresis in Wc low-melting point agarose gels using 0.5X TBE 
buffer at 14°C and a 30 second constant switch time at 5 V/cm for 36 hr. Gel slices 
containing the YAC of interest were equilibrated 2 hr with microinjection buffer 
containing 10 mM Tris-Cl pH 7.5, O.ImM EDTA pH 8.0, 100 mM NaCl, 30 mM 
spermine, and 70 mM spermidine. The gel slices were melted at 68°C for 20 min and 

30 then digested with GELase (5 U/100 mg) at 420C for 2 hr. Integrity of Each YAC 
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sample was then confirmed by CHEF electrophoresis on a 1.5% agarose gel with 0.5X 
TBE buffer at 14°C using a 30 second constant switch time and 5 V/cm for 24 hr. 

Example 31; Transgcncsis and immunoglobulin expression 
5 Purified heavy-chain and light-chain YA(^ DNAs prepared as disclosed in 

Example 3 1 were co-injected into early embryos to generate transgenic animals as 
essentially disclosed in Example 21. A volume of 50 nl of 110 pg chromosome DNA 
per |j.l of microinjection buffer was injected into each of several hundred embryos. 
Testing for the production of human light-chain in scrum of resultant chickens was 

10 performed using a human lambda ELISA quantitation kit (#E80-116) from Befhyl 
Laboratories (Montgomery, TX). In the procedure, both the capture antibody and 
detection antibodies were diluted 1:2000. Quantitation of antibody containing 
associated light-chain and heavy-chains was performed by replacing the detection 
antibody in the above kit with an alkaline phosphatase-conjugated goat anti-human 

15 IgG, Fc gamma-antibody (diluted 1:2000) (#109-056-098, Jackson ImmunoRcscarch 
Laboratories, Inc., West Grove, PA) and followed by detection using a TMB substrate. 
At least one bird was shown to express human immunoglobuUns by ELISA (Table 5) 
in the scrum. 

20 Table 5 



Bird 


Lambda Light-Chain 


Whole IrGX 


#6946 


26 ttg/ml 


24 ng/ml 


Control 


0 


0 



Example 32 

25 Identification of Target Sequences and Probe Preparation 

A 234bp sequence of mouse major satellite DNA sequence (SEQ ID NO: 14) 
present in SATACs (see, for example, US patent publication No. 2003/0119104, filed 
May 30, 2002, the disclosure of which is incorporated in its entirety herein by 
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reference) was scanned for 6mer repeat regions. Two such sequences were identified, 
each present five times in the 234bp sequence. These consensus sequences can be 
targets for labeled polyamide probes. The identified sequences are shown in Table 6. 

5 SEQ ID NO: 14, Mouse Major Satellite DNA sequence 

gacctggaatatcgcgagtaaactgaaaatcacggaaaatgagaaatacacactttaggacgtgaaatatggcgaggaaaa 
ctgaaaaaggtggaaaatttagaaatgtccactgtaggacgtggaatatggcaagaaaactgaaaatcatggaaaatgagaa 
acatccacttgacgacttgaaaaatgacgaaatcactaaaaaacgtgaaaaatgagaaatgcacactgaaggac 

10 Table 6 



|l\iUcrn|Searched|( Onscnsus 




Itgaaaa 


TGAAAA 


5 


GAAAAT 


GAAAAT 


5 



The centromeric region of SATACs contain copies of the mouse minor satellite 
sequence which can also be targeted. For example, polyamide probes can be designed 
15 to target pentameric repeat sequences and octameric repeat sequences located within 
the 120 bp sequence of the mouse minor satellite (SEQ ID NO: 15). The consensus or 
repeat sequences identified are shown in Table 7 . 

SEQ ID NO: 15, Mouse minor satelhte DNA Sequence 

20 gagtgagttacactgaaaaacacatacgttggaaaccggcattgtagaacagtgtatatcaatgagttacaatgagaaaaatg 
gaaaatgataaaaaccacagtgtagaacatattagatgtgtgagttacactgaaaaacacattccttggaaacgggatttgtag 
aactgtgtatatcaatgagttacaatgagaaacatggaaaatgataaaaaccacactgtagaacattttagatgagtgagttaca 
ctgaaaaacacatatgttggaaa 

25 
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Table? 



j Pattern SeiavchbdJ 


Consensws 


Copy No 


|gaaaa 


GAAAA 


6 


AAAAA 


AAAAA 


6 


AATGA 


AATGA 


6 


TGAGTTAC 


TGAGTTAC 


5 


GAGTTACA 


GAGTTACA 


5 



The heterochromatic region of SATACs contain copies of the mouse rDNA 
5 (ribosomal RNA encoding DNA) sequence can also be targeted. For example, 
polyamide probes can be designed to target 5-mer repeat sequences, six-mer repeat 
sequences and seven-mer repeat sequences located within the 120 bp sequence of 
mouse rDNA shown in SEQ ID NO. 16. The consensus or repeat sequences are 
shown in Table 8. 

10 

Table 8 



Pattern ISearchedl IConsensus llCopy |NoI 


TGTGC 1 


TGTGC 6 


TTCCC 1 


TTCCCJL 6 


CGTGC 1 


CGTGC L 8 


CCGCC 1 


CCGCC L 2 1 


CGCCG 


CGCCG i[_ 25 


CCCGCG 


CCCGCG IL 15 


CCCGTC 


CCCGTC IL 5 


j CCGGCG 1 


CCGGCG I 7 


1 CCCGGG 


CrCG(-GrT. 5 


1 TCTCTCG 


TCTCTCGJI 6 
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Polyamide probes can be constructed to recognize the repeat sequences shown 
in Tables 6, 7 and 8 and/or other sequences contained in artificial chromosomes that 
wiU provide for a facihtated isolation of the artificial chromosomes, for example, by 
flow cytometry. Methods of making polyamide probes are well known in the art and 

5 are disclosed, for example, in the certain references cited herein and in the 
specification. 

Example 33 

Preparation and flow sorting of chromosomes 

10 Chromosome suspensions are prepared from CHO cells (e.g., chromosome 

suspensions containing artificial chromosomes such as those disclosed in US patent 
pubhcation No. 2003/0119104, filed May 30, 2002) using a modification of the 
polyamine-based method described by SiUar and Young (1981) A new method for the 
preparation of metaphase chromosomes for flow analysis, J. Histochem. Cytochem., 

15 29, 74-78 and Lalande et al (1984) Development and use of metaphase chromosome 
flow- sorting methodology to obtain recombinant phage libraries enriched for parts of 
the human X chromosome, Cytometry, 5, 101-107, the disclosures of which arc 
incorporated herein in their entirety by reference. Briefly, cells cultured in RPMI 
medium containing 20% fetal bovine serum are arrested at mitosis by incubation in 0. 1 

20 ^ig/ml colcemid for about 16 h. The cells are collected by centrifugation, resuspended 
in 40 mM KCl for 10 min and then ccntrifugcd again. The pellet is resuspended in 
cold buffer containing 80 mM KCl, 20 mM NaCl, 15 mM Tris-HCl pH 7.2, 2 mM 
EDTA, 0.5 mM EGTA, 7 mM P-mercaptoethanol, 0.2 mM spermine, 0.5 mM 
spermidine and 0.12% digitonin, and incubated on ice for 10 min. The suspension is 

25 vortexed vigorously for 2 min and then stored for up to 90 days at 4°C before use. 

Prior to flow analysis, the chromosomes are stained for about 2 h with 1 fiM of 
fluorescently labeled polyamide probe and 2 Jlg/ml HO (Hoechst 33258). The 
polyamide probe targets one or more of the nucleotide sequences specified in Table 6, 
Table 7 and/or Table 8 and is produced essentially as disclosed in Dervan (2001) 

30 Molecular recognition of DNA by small molecules. Bioorg Med Chem 9 : 2215-35. 
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Sodium citrate and sodium sulfite are added to chromosomes 15 to 30 min 
before flow analysis at final concentrations of 10 and 25 mM, respectively, to improve 
chromosome resolution. 

Chromosomes are separated using an Influx flow sorter (Cytopeia, Inc., Seattle, 
5 WA). One laser is tuned to emit ultraviolet light (351-364 nm, 250 mW) to excite 
HO, and HO fluorescence is measured after passing through a 425-nm long-pass filter 
and a 458-nm rejection-band filter. A second laser is tuned to 458 nm (250 mW) to 
excite the fluorescently labeled polyamide probe. The fluorescently labeled polyamide 
probe fluorescence is mea.sured after passing through a 500-nm long-pass filter and a 
10 458-nm rejection-band filter. Alternatively, fluorescein fluorescence is measured 
following excitation at 488 nm (250 mW) after passing through a 530/40 band-pass 
filter. The fluorescence pulses from the individual chromosomes are integrated by a 
data acquisition system, and are collected in listmode at a rate of about 1000 
chromosomes per second. 

15 

Example 34 

Purification of SATACs Contained in Micronuclei 

Micronucleation of chromosomes in ChYl cells containing the artificial 
chromosome (e.g., SATACs) is induced by incubation for 72 h in the presence of 1 

20 ^g/ml of colchicine in growth medium. 

Micronuclei are isolated essentially as described in Labidi (Labidi, B, et al. 
Procedure for isolating micronuclei from rat kangaroo cultured cells containing 
individualized chromosomes. Eur J Cell Biol 38: 165-70, 1985), the disclosure of 
which is incorporated in its entirety herein by reference. Briefly, micronucleated cells 

25 are harvested by trypsin-EDTA treatment, rinsed twice in PBS, and resuspended in 2 
vol TKM buffer (10 mM Tris-HCl, pH 7.4, 10 mM KCl, and 3 mM MgC12) 
containing 0.05% coUagenase 1 A (Sigma). Cell lysis is accelerated by gentle shearing 
of the suspension through a 26-gauge needle and 1 mM phenyl methyl sulfonyl 
fluoride (PMSF) is added. Isolated micronuclei are collected by low-speed 

30 ccntrifugation (I500g) and washed twice with 4 vol of ins-polyaminc buffer (I'PB; 15 
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niM Tris-HCI, pH 7.4, 0.2 mM spermine, 0.5 niM spermidine, 2 niM EDTA, 0.5 mM 
EGTA, 80 mM KCl, 20 mM NaCl, and 14 mM B-mercaptoethanol). 

The micronuclei containing chromosomes are stained and purified by flow 
cytometry in essentially the same manner as described for the staining and flow 
5 cytometry purification of artificial chromosomes, as disclosed in Rxample 33. 

Diiring the flow cytometry, micronuclei collection is limited to the window in 
the fluorescence histogram where micronuclei containing a single SATAC are located, 
which can be defined by conventional methodologies. 

10 Example 35 

Production of Transchromosomic Chickens Using Satellite DNA-Based Artificial 
Chromosomes 

The flow-sorted artificial chromosomes of Example 33 or micronuclei 
containing the artificial chromosomes of Example 34 are pelleted by centrifugation of 

15 a 750|.l1 sample containing approximately 10^ chromosomes (arlificial chromosomes 
or micronuclei containing artificial chromosomes) at 2500 x g for 30 min at 4^C. The 
supernatant, except the bottom 30 microliters (|il) containing the chromosomes, is 
removed resulting in a concentration of about 7000 to 11,500 chromosomes per [ii of 
injection buffer (Monteith, et al. Methods Mol Biol 240: 227-242, 2004). 

20 Approximately 25 to 100 nanoliters (nl) of injection buffer is injected per embryo. 

Early stage embryos (e.g., stage I embryos) are collected from 24 to 36 week- 
old hens from commercial White Leghorn variety of G. gaUus. Embryo donor hens are 
inseminated weekly using pooled semen from roosters of the same breed to produce 
eggs for injection. 

25 On the day of egg collection, fertile hens are euthanized 2h post oviposition by 

cervical dislocation. Typically, oviposition is followed by ovulation of the next egg 
after about 24 minutes (Morris, Poultry Science 52: 423-445, 1973). The recently 
ovulated and fertilized eggs are collected from the upper magnum region of the 
oviduct under sterile conditions and placed in a glass well and covered with Ringers' 
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Medium (Tanaka, et al. J Reprod Fertil 100: 447-449, 1994) and maintained at 4 l^C 
until microinjection. 

Cytoplasmic injection of artificial chromosomes is achieved using the 
microinjection apparatus disclosed in US Patent Application No. 11/159,973, filed 
5 June 23, 2005, the disclosure of which is incorporated in its entirety herein by 
reference. Chromosomes are injected into the Stage T embryos at a single site. Each 
embryo is cytoplasmically injected with approximately 400 to 1000 chromosomes. 
The chromosomes are injected in a suspension of 25 to 100 nanoliters (nl) of injection 
buffer. 

10 Following microinjection, the embryos are transferred to the oviduct of 

recipient hens using the ovum transfer (OT) procedure of Olsen (Olsen, M and Neher, 
B. J Exp Zool 109: 355-66, 1948), with the exception that the hens are anesthetized by 
isofluorane gas. Typically, about 26h after OT^the recipient hens lay a hard shell egg 
containing the manipulated ovum. Eggs are incubated for 21 days in a regular 

15 incubatdr until hatching (li'thc birds. 

Transchromosomic founders axe identified based on PCR analysis and FISH 
analysis. Analysis of Gl embryos from a test bird show the artificial chromosome to 
be transmitted through the germlinc. 

While this invention has been described with respect to various specific 

20 examples and embodiments, it is to be understood that the invention is not limited 
thereto and that it can be variously practiced with the scope of the following claims. 
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What is claimed is: 

1. A method comprising: 
isolating an artificial chromosome; 

5 introducing the artificial chromosome into an avian embryo; 

maintaining the embryo under conditions suitable for the embryo to 
develop and hatch as a chick; and 

maintaining the chick under conditions suitable to obtain a mature 
avian wherein the artificial chromosome is present in the genome of the mature avian. 

10 

2. The method of claim 1 wherein isolating the artificial chromosome is 
facilitated by flow cytometry. 

3. The method of claim 2 wherein the flow cytometry is facilitated by a 
15 probe which is associated with the artificial chromosome. 

4 . The method of claim 3 wherein the probe is a polyamide probe. 

5. The method of claim 2 wherein the artificial chromosome is present in 
20 a micronuclei. 

6. 'ITie method of claim 1 wherein the artificial chromosome is introduced 
into the avian embryo by injection. 

25 7 . The method of claim 1 comprising transferring the embryo to a recipient 

female avian. 

8. The method of claim 1 wherein the embryo is an early stage embryo. 
30 9. 'ITie method of claim 1 wherein the embryo is a stage 1 embryo. 
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10. The method of claim 1 wherein the avian is a chicken. 

11. The method of claim 1 wherein the artificial chromosome comprises a 

5 heterologous recombination site. 

12. The method of claim 11 wherein the artificial chromosome comprises 
more than one heterologous recombination site. 

10 13. The method of claim 1 wherein the artificial chromosome comprises a 

heterologous coding sequence. 

14. 'ITie method of claim 13 wherein the heterologous coding sequence 
comprises a pharmaceutical protein coding sequence. 

15 

15. The method of claim 13 wherein the heterologous coding sequence 
encodes an immunoglobulin polypeptide. 

16. The method of claim 13 wherein the heterologous coding sequence 
20 encodes a cytokine. 

17. The method of claim 1 wherein the artificial chromosome comprises a 
promoter. 

25 18. The method of claim 1 wherein the artificial chromosome comprises a 

promoter which functions in tubular gland cells. 

19. The method of claim 1 wherein the artificial chromosome comprises an 

IRES. 

30 
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20. The method of claim 1 comprising obtaining an offspring from the 
mature avian wherein the offspring contains an artificial chromosome in its genome. 



2 1 . The method of claim 1 wherein the artificial chromosome is a stabilized 
artificial chromosome isolated from an avian cell. 



22. An avian produced by the method of claim 1. 



23 . A method comprising : 

isolating an artificial chromosome; 

introducing the artificial chromosome into an avian embryo by 

injection; 

maintaining the embryo under conditions suitable for the embryo to 
develop and hatch as a chick; and 

maintaining the chick under ctmdilitjns suitable to tjbtain a mature 
avian wherein the artificial chromosome is present in the genome of the mature avian. 



24. The method of claim 23 comprising transferring the embryo to a recipient 
female avian. 
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Fig. 4 
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Fig. 5 
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Fig. 6 
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pCMV-C31int (SEQ ID NO: 1) 

CATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCTATT 

ACGCCAGCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGGATCG 

ATCCAGACATGATAAGATACATTGATGAGTTTGGACAAACCACAACTAGAATGCAGTGAAAA 

AAATGCTTTATTTGTGAAATTTGTGATGCTATTGCTTTATTTGTAACCATTATAAGCTGCAA 

T AAACAAGT T AAC AACAACAATTGCATTCATTTT ATGTTT CAGGTT CAGGGGGAGGTGTGGG 

AGGTTTTTTAAAGCAAGTAAAACCTCTACAAATGTGGTATGGCTGATTATGATCATGAACAG 

ACTGTGAGGACTGAGGGGCCTGAAATGAGCCTTGGGACTGTGAATCTAAAATACACAAACAA 

TTAGAATCACTAGCTCCTGTGTATAATATTTTCATAAATCATACTCAGTAAGCAAAACTCTC 

AAGCAGCAAGCATATGCAGCTAGTTTAACACATTATACACTTAAAAATTTTATATTTACCTT 

AGAGCTTTAAATCTCTGTAGGTAGTTTGTCCAATTATGTCACACCACAGAAGTAAGGTTCCT 

TCACAAAGATCCCAAGCTAGCTTATAATACGACTCACTATAGGGAGAGAGCTATGACGTCGC 

ATGCACGCGTAAGCTTGGGCCCCTCGAGGGATCCGGGTGTCTCGCTACGCCGCTACGTCTTC 

CGTGCCGTCCTGGGCGTCGTCTTCGTCGTCGTCGGTCGGCGGCTTCGCCCACGTGATCGAAG 

CGCGCTTCTCGATGGGCGTTCCCTGCCCCCTGCCCGTAGTCGACTTCGTGACAACGATCTTG 

TCTACGAAGAGCCCGACGAACACGCGCTTGTCGTCTACTGACGCGCGCCCCCACCACGACTT 

AGGGCCGGTCGGGTCAGCGTCGGCGTCTTCGGGGAACCATTGGTCAAGGGGAAGCTTCGGGG 

CTTCGGCGGCTTCAAGTTCGGCAAGCCGCTCTTCCGCCCCTTGCTGCCGGAGCGTCAGCGCT 

GCCTGTTGCTTCCGGAAGTGCTTCCTGCCAACGGGTCCGTCGTACGCGCCTGCCGCGCGGTC 

TTCGTACAGCTCTTCAAGGGCGTTCAGGGCGTCGGCGCGCTCCGCAACAAGGTTCGCCCGTT 

CGCCGCTCTTCTCAGGCGCCTCAGTGAGCTTGCCGAAGCGTCGGGCGGCTTCCCACAGAAGC 

GCCAACGTCTCTTCGTCGCCTTCGGCGTGCCTGATCTTGTTGAAGATGCGTTCCGCAACGAA 

CTTGTCGAGTGCCGCCATGCTGACGTTGCACGTGCCTTCGTGCTGCCCAGGTGCGGACGGGT 

CGACCACCTTCCGGCGACGGCAGCGGTAAGAGTCCTTGATCGATTCTTCCCCGCGCTTCGAA 

GTCATGACGGCGCCACACTCGCAGTACAGCTTGTCCATGGCGGACAGAATGGCTTGCCCCCG 

GGAAAGCCCCTTGCCGCGCCCCCTGCCGTCCAACCACGCCTGAAGCTCATACCACTCAGCGG 

GCTCGATGATCGGTCCGCAATCAAGCTCGACCGGCCGGAGCGTGATCGGGTCGCGCTGAATG 

CGGTAACCCTCAATCTTCGTGGTCGGCGTGCCGTCCGGCTTCTTCTTGTAGATCACCTCAGC 

GGCGAAGCCCGCAATACGCGGGTCCCGAAGGATTCGCATAACGGTTGCCGGGTCCCAGGCGC 

TTGAAGCGGTCTTCTTCCCAATCGTCTCGCCCCGGGTCGGCACGGCGTCAGCGTCCATGCGC 

TTACAAAGCCCCGTGATGCTGCCCGGGTGAATGGCGGCTTGACTGCCCGGCTTGAAGGGAAG 

GTGTTTGTGCGTCTTGATCTCACGCCACCACCACCGGATTACGTCGGGCTCGAACTCGAAGG 

GTCCGGTAAGGGGAGTGGTCGAGTGCGCAAGCTTGTTGATGACGACATTGACCATTCGGCCG 

TTGCGCGTGATCTCCTTCGTCTCCGAAACAAGCTCGAAGCCGTAAGGCGCCTTCCCGCCGAC 

GTACCCGCCCAATTCGCGCTGAAGGTTCTTCGTGTCGAGAATCTTCGCCGACTTCAGCGAAG 

ATTCTTTGTGCGACGCGTCGAGCCGCATAATCAGGTGAATCAGGTCCATGACGTTTCCCTGC 

CGGAAGACGCCTTCCTGAGTGGAAACAATCGTCACGCCCAGGGCGAGCAATTCCGAGACAAT 

CGGAATCGCGTCCATGACCTTCAGGCGCGAGAAGCGCGACACGTCATAGACAATGATCATGT 

TGAGCCGCCCGGCGCGGCATTCGTTCAGGATGCGTTCGAACTCCGGGCGCTCCGCCGTCCCG 

AACGCCGACGTGCCCGGCGCTTCGCTGAAATGCCCGACGAACCTGAACCGGCCCCCGTCGCG 

CTCGACTTCGCGCTGAAGGTCGGCCGCCTTGTCTTCGTTGGCGCTACGCTGTGTCGCTGGGC 

TTGCTGCGCTCGAATTCTCGCGCTCGCGCGACTGACGGTCGTAAGCACCCGCGTACGTGTCC 

ACCCCGGTCACAACCCCTTGTGTCATGTCGGCGACCCTACGACTAGTGAGCTCGTCGACCCG 

GGAATTCCGGACCGGTACCTGCAGGCGTACCTTCTATAGTGTCACCTAAATAGCTTTTTGCA 

"AAAGCCTAGGCTAGAGTCCGGAGGCTGGATCGGTCCCGGTGTCTTCTATGGAGGTCAAAACA 

GCGTGGATGGCGTCTCCAGGCGATCTGACGGTTCACTAAACGAGCTCTGCTTATATAGACCT 

CCCACCGTACACGCCTACCGCCCATTTGCGTCAATGGGGCGGAGTTGTTACGACATTTTGGA 

AAGTCCCGTTGATTTTGGTGCCAAAACAAACTCCCATTGACGTCAATGGGGTGGAGACTTGG 

AAATCCCCGTGAGTCAAACCGCTATCCACGCCCATTGATGTACTGCCAAAACCGCATCACCA 

TGGTAATAGCGATGACTAATACGTAGATGTACTGCCAAGTAGGAAAGTCCCATAAGGTCATG 

TACTGGGCATAATGCCAGGCGGGCCATTTACCGTCATTGACGTCAATAGGGGGCGTACTTGG 

CATATGATACACTTGATGTACTGCCAAGTGGGCAGTTTACCGTAAATACTCCACCCATTGAC 

GTCAATGGAAAGTCCCTATTGGCGTTACTATGGGAACATACGTCATTATTGACGTCAATGGG 

CGGGGGTCGTTGGGCGGTCAGCCAGGCGGGCCATTTACCGTAAGTTATGTAACGACCTGCAC 
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GATGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACATTATACGAGCCGGAA 
GCTATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGilAAGGGCCTCGTATACGCCTATTTTT 
ATAGGTTAATGTCATGATAATAATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATG 
TGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGA 
CAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGCGCGAATTGCAAGCTCTGCATTA 
ATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGC 
TCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCG 
GTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCA 
GCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCC 
CTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAA 
AGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCT 
TACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCT 
GTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCC 
GTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACA 
CGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCG 
GTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGT 
ATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAA 
ACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAA 
AAGGATCTCAAGAAGATCCTTTGATCTTTTGTACGGGGTCTGACGCTCAGTGGAACGAAAAC 
TCACGTTAAGGGATTTTGGTCATGCCATAACTTCGTATAGCATACATTATACGAAGTTATGG 
CATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAAT 
CAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCA 
CCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGAT 
AACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCAC 
GCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGT 
GGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAG 
TAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTGTCAC 
GCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGA 
TCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAA 
GTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGC 
CATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGT 
ATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCAG 
AACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTAC 
CGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTT 
ACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAAT 
AAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTT 
ATCAGGGTTATTGTCTCATGCCAGGGGTGGGCACACATATTTGATACCAGCGATCCCTACAC 
AGCACATAATTCAATGCGACTTCCCTCTATCGCACATCTTAGACCTTTATTCTCCCTCCAGC 
ACACATCGAAGCTGCCGAGCAAGCCGTTCTCACCAGTCCAAGACCTGGCATGAGCGGATACA 
TATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTG 
CCACCTGAAATTGTAAACGTTAATATTTTGTTAAAATTCGCGTTAAATTTTTGTTAAATCAG 
CTCATTTTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTATAAATCAAAAGAATAGACCG 
AGATAGGGTTGAGTGTTGTTCCAGTTTGGAACAAGAGTCCACTATTAAAGAACGTGGACTCC 
AACGTCAAAGGGCGAAAAACCGTCTATCAGGGCGATGGCCCACTACGTGAACCATCACCCTA 
ATCAAGTTTTTTGGGGTCGAGGTGCCGTAAAGCACTAAATCGGAACCCTAAAGGGAGCCCCC 
GATTTAGAGCTTGACGGGGAAAGCCGGCGAACGTGGCGAGAAAGGAAGGGAAGAAAGCGAAA 
GGAGCGGGCGCTAGGGCGCTGGCAAGTGTAGCGGTCACGCTGCGCGTAACCACCACACCCGC 
CGCGCTTl^TGCGCCGCTACAGGGCGCGTC 
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pCMV-luc-attB (SEQ ID NO: 2) 

CTCTATCGATAGGTACCGAGCTCTTACGCGTGCTAGCCCTCGAGCAGGATCTATACATTGAA 
TCAATATTGGCAATTAGCCATATTAGTCATTGGTTATATAGCATAAATCAATATTGGCTATT 
GGCCATTGCATACGTTGTATCTATATCATAATATGTACATTTATATTGGCTCATGTCCAATA 
TGACCGCCATGTTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATT 
AGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCT 
GACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCA 
ATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGT 
ACATCAAGTGTATCATATGCCAAGTCCGCCCCCTATTGACGTCAATGACGGTAAATGGCCCG 
CCTGGCATTATGCCCAGTACATGACCTTACGGGACTTTCCTACTTGGCAGTACATCTACGTA 
TTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCG 
GTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGC 

accaaaatcaacgggactttccaaaatgtcgtaacaactccgccccattgacgcaaatgggc 
ggtaggcgtgtacggtgggaggtctatataagcagagctcgtttagtgaaccgtcagatcgc 
ctggagacgccatccacgctgttttgacctccatagaagacaccgggaccgatccagcctcc 
cctcgaagctcgactctaggggctcgagatctgcgatctaagtaagcttggcattccggtac 
tgttggtaaagccaccatggaagacgccaaaaacataaagaaaggcccggcgccattctatc 
cgctggaagatggaaccgctggagagcaactgcataaggctatgaagagatacgccctggtt 
cctggaacaattgcttttacagatgcacatatcgaggtggacatcacttacgctgagtactt 
cgaaatgtccgttcggttggcagaagctatgaaacgatatgggctgaatacaaatcacagaa 
tcgtcgtatgcagtgaaaactctcttcaattctttatgccggtgttgggcgcgttatttatc 
ggagttgcagttgcgcccgcgaacgacatttataatgaacgtgaattgctcaacagtatggg 
catttcgcagcctaccgtggtgttcgtttccaaaaaggggttgcaaaaaattttgaacgtgc 
aaaaaaagctcccaatcatccaaaaaattattatcatggattctaaaacggattaccaggga 
tttcagtcgatgtacacgttcgtcacatctcatctacctcccggttttaatgaatacgattt 
tgtgccagagtccttcgatagggacaagacaattgcactgatcatgaactcctctggatcta 
ctggtctgcctaaaggtgtcgctctgcctcatagaactgcctgcgtgagattctcgcatgcc 
agagatcctatttttggcaatcaaatcattccggatactgcgattttaagtgttgttccatt 
ccatcacggttttggaatgtttactacactcggatatttgatatgtggatttcgagtcgtct 
taatgtatagatttgaagaagagctgtttctgaggagccttcaggattacaagattcaaagt 
gcgctgctggtgccaaccctattctccttcttcgccaaaagcactctgattgacaaatacga 
tttatctaatttacacgaaattgcttctggtggcgctcccctctctaaggaagtcggggaag 
cggttgccaagaggttccatctgccaggtatcaggcaaggatatgggctcactgagactaca 
tcagctattctgattacacccgagggggatgataaaccgggcgcggtcggtaaagttgttcc 
attttttgaagcgaaggttgtggatctggataccgggaaaacgctgggcgttaatcaaagag 
gcgaactgtgtgtgagaggtcctatgattatgtccggttatgtaaacaatccggaagcgacc 
aacgccttgattgacaaggatggatggctacattctggagacatagcttactgggacgaaga 
cgaacacttcttcatcgttgaccgcctgaagtctctgattaagtacaaaggctatcaggtgg 
ctcccgctgaattggaatccatcttgctccaacaccccaacatcttcgacgcaggtgtcgca 
ggtcttcccgacgatgacgccggtgaacttcccgccgccgttgttgttttggagcacggaaa 
gacgatgacggaaaaagagatcgtggattacgtcgccagtcaagtaacaaccgcgaaaaagt 
tgcgcggaggagttgtgtttgtggacgaagtaccgaaaggtcttaccggaaaactcgacgca 
agaaaaatcagagagatcctcataaaggccaagaagggcggaaagatcgccgtgtaattcta 
gagtcggggcggccggccgcttcgagcagacatgataagatacattgatgagtttggacaaa 
ccacaactagaatgcagtgaaaaaaatgctttatttgtgaaatttgtgatgctattgcttta 
tttgtaaccattataagctgcaataaacaagttaacaacaacaattgcattcattttatgtt 
tcaggttcagggggaggtgtgggaggttttttaaagcaagtaaaacctctacaaatgtggta 
aaatcgataaggatcaattcggcttcaggtaccgtcgacgatgtaggtcacggtctcgaagc 
cgcggtgcgggtgccagggcgtgcccttgggctccccgggcgcgtactccacctcacccatc 
tggtccatcatgatgaacgggtcgaggtggcggtagttgatcccggcgaacgcgcggcgcac 
cgggaagccctcgccctcgaaaccgctgggcgcggtggtcacggtgagcacgggacgtgcga 
cggcgtcggcgggtgcggatacgcggggcagcgtcagcgggttctcgacggtcacggcgggc 
atgtcgacagccgaattgatccgtcgaccgatgcccttgagagccttcaacccagtcagctc 
cttccggtgggcgcggggcatgactatcgtcgccgcacttatgactgtcttctttatcatgc 
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AACTCGTAGGACAGGTGCCGGCAGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTC 

GGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAG 

AATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGT 

AAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAA 

TCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCC 

CTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCC 

TTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTAGGTATCTCAGTTCGGT 

GTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCG 

CCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCA 

GCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAA 

GTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGC 

CAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGC 

GGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCC 

TTTGATGTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGG 

TCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAA 

TCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGC 

ACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGA 

TAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCA 

CGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAG 

TGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAA 

GTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTGTCA 

CGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATG 

ATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTA 

AGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATG 

CCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTG 

TATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCA 

GAACTTTAAAAGTGCTCATCATTGGAiVAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTA 

CCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTT 

TACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAA 

TAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATT 

TATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAAT 

AGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGCGCCCTGTAGCGGCGCATTAA 

GCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCC 

GCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCT 

AAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAAC 

TTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTT.TTCGCCCTTTG 

ACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACCC 

TATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAl^AA 

ATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGTTTACAATTTCC 

CATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCTATT 

ACGCCAGCCCAAGCTACCATGATAAGTAAGTAATATTAAGGTACGGGAGGTACTTGGAGCGG 

CCGCAATAAAATATCTTTATTTTCATTACATCTGTGTGTTGGTTTTTTGTGTGAATCGATAG 

TACTAACATACGCTCTCCATCAAAACAAAACGAAACAAAACAAACTAGCAAAATAGGCTGTC 

CCCAGTGCAAGTGCAGGTGCCAGAACATTT 
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pCMV-luc-attP (SEQ ID NO: 3) 

CTCTATCGATAGGTACCGAGCTCTTACGCGTGCTAGCCCTCGAGCAGGATCTATACATTGAA 

TCAATATTGGCAATTAGCCATATTAGTCATTGGTTATATAGCATAAATCAATATTGGCTATT 

GGCCATTGCATACGTTGTATCTATATCATAATATGTACATTTATATTGGCTCATGTCCAATA 

TGACCGCCATGTTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATT 

AGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCT 

GACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCA 

ATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGT 

ACATCAAGTGTATCATATGCCAAGTCCGCCCCCTATTGACGTCAATGACGGTAAATGGCCCG 

CCTGGCATTATGCCCAGTACATGACCTTACGGGACTTTCCTACTTGGCAGTACATCTACGTA 

TTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCG 

GTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGC 

ACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGC 

GGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCGTTTAGTGAACCGTCAGATCGC 

CTGGAGACGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCC 

CCTCGAAGCTCGACTCTAGGGGCTCGAGATCTGCGATCTAAGTAAGCTTGGCATTCCGGTAC 

TGTTGGTAAAGCCACCATGGAAGACGCCAAAAACATAAAGAAAGGCCCGGCGCCATTCTATC 

CGCTGGAAGATGGAACCGCTGGAGAGCAACTGCATAAGGCTATGAAGAGATACGCCCTGGTT 

CCTGGAACAATTGCTTTTACAGATGCACATATCGAGGTGGACATCACTTACGCTGAGTACTT 

CGAAATGTCCGTTCGGTTGGCAGAAGCTATGAAACGATATGGGCTGAATACAAATCACAGAA 

TCGTCGTATGCAGTGAAAACTCTCTTCAATTCTTTATGCCGGTGTTGGGCGCGTTATTTATC 

GGAGTTGCAGTTGCGCCCGCGAACGACATTTATAATGAACGTGAATTGCTCAACAGTATGGG 

CATTTCGCAGCCTACCGTGGTGTTCGTTTCCAAAAAGGGGTTGCAAAAAATTTTGAACGTGC 

AAAAAAAGCTCCCAATCATCCAAAAAATTATTATCATGGATTCTAAAACGGATTACCAGGGA 

TTTCAGTCGATGTACACGTTCGTCACATCTCATCTACCTCCCGGTTTTZy^TGAATACGATTT 

TGTGCCAGAGTCCTTCGATAGGGACAAGACAATTGCACTGATCATGAACTCCTCTGGATCTA 

CTGGTCTGCCTAAAGGTGTCGCTCTGCCTCATAGAACTGCCTGCGTGAGATTCTCGCATGCC 

AGAGATCCTATTTTTGGCAATCAAATCATTCCGGATACTGCGATTTTAAGTGTTGTTCCATT 

CCATCACGGTTTTGGAATGTTTACTACACTCGGATATTTGATATGTGGATTTCGAGTCGTCT 

TAATGTATAGATTTGAAGAAGAGCTGTTTCTGAGGAGCCTTCAGGATTACAAGATTCAaAGT 

GCGCTGCTGGTGCCAACCCTATTCTCCTTCTTCGCCAAAAGCACTCTGATTGACAAATACGA 

TTTATCTAATTTACACGAAATTGCTTCTGGTGGCGCTCCCCTCTCTAAGGAAGTCGGGGAAG 

CGGTTGCCAAGAGGTTCCATCTGCCAGGTATCAGGCAAGGATATGGGCTCACTGAGACTACA 

TCAGCTATTCTGATTACACCCGAGGGGGATGATAAACCGGGCGCGGTCGGTAAAGTTGTTCC 

ATTTTTTGAAGCGAAGGTTGTGGATCTGGATACCGGGAAAACGCTGGGCGTTAATCAAAGAG 

GCGAACTGTGTGTGAGAGGTCCTATGATTATGTCCGGTTATGTAAACAATCCGGAAGCGACC 

AACGCCTTGATTGACAAGGATGGATGGCTACATTCTGGAGACATAGCTTACTGGGACGAAGA 

CGAACACTTCTTCATCGTTGACCGCCTGAAGTCTCTGATTAAGTACAAAGGCTATCAGGTGG 

CTCCCGCTGAATTGGAATCCATCTTGCTCCAACACCCCAACATCTTCGACGCAGGTGTCGCA 

GGTCTTCCCGACGATGACGCCGGTGAACTTCCCGCCGCCGTTGTTGTTTTGGAGCACGGAAA 

GACGATGACGGAAAAAGAGATCGTGGATTACGTCGCCAGTCAAGTAACAACCGCGAAAAAGT 

TGCGCGGAGGAGTTGTGTTTGTGGACGAAGTACCGAAAGGTCTTACCGGAAAACTCGACGCA 

AGAAAAATCAGAGAGATCCTCATAAAGGCCAAGAAGGGCGGAAAGATCGCCGTGTAATTCTA 

GAGTCGGGGCGGCCGGCCGCTTCGAGCAGACATGATAAGATACATTGATGAGTTTGGACAAA 

CCACAACTAGAATGCAGTGAAAAAAATGCTTTATTTGTGAAATTTGTGATGCTATTGCTTTA 

TTTGTAACCATTATAAGCTGCAATAAACAAGTTAACAACAACAATTGCATTCATTTTATGTT 

TCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTAAAGCAAGTAAAACCTCTACAAATGTGGTA 

AAATCGATAAGGATCAATTCGGCTTCGACTAGTACTGACGGACACACCGAAGCCCCGGCGGC 

AACCCTCAGCGGATGCCCCGGGGCTTCACGTTTTCCCAGGTCAGAAGCGGTTTTCGGGAGTA 

GTGCCCCAACTGGGGTAACCTTTGAGTTCTCTCAGTTGGGGGCGTAGGGTCGCCGACATGAC 

ACAAGGGGTTGTGACCGGGGTGGACACGTACGCGGGTGCTTACGACCGTCAGTCGCGCGAGC 

GCGACTAGTACAAGCCGAATTGATCCGTCGACCGATGCCCTTGAGAGCCTTCAACCCAGTCA 

GCTCCTTCCGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGACTGTCTTCTTTATC 

ATGCAACTCGTAGGACAGGTGCCGGCAGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGC 
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GCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCC 
ACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAA 
CCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACA 
AAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTT 
CCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATAGCTGTC 
CGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTAGGTATCTCAGTT 
CGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGC 
TGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACT 
GGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCT 
TGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTG 
AAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGG 
TAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAG 
ATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATT 
TTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTT 
TAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTG 

aggcacctatctcagcgatctgtctatttcgttcatccatagttgcctgactccccgtcgtg 
tagataactacgatacgggagggcttaccatctggccccagtgctgcaatgataccgcgaga 
cccacgctcaccggctccagatttatcagcaataaaccagccagccggaagggccgagcgca 
gaagtggtcctgcaactttatccgcctccatccagtctattaattgttgccgggaagctaga 
gtaagtagttcgccagttaatagtttgcgcaacgttgttgccattgctacaggcatcgtggt 
gtcacgctcgtcgtttggtatggcttcattcagctccggttcccaacgatcaaggcgagtta 
catgatcccccatgttgtgcaaaaaagcggttagctccttcggtcctccgatcgttgtcaga 
agtaagttggccgcagtgttatcactcatggttatggcagcactgcataattctcttactgt 
catgccatccgtaagatgcttttctgtgactggtgagtactcaaccaagtcattctgagaat 
agtgtatgcggcgaccgagttgctcttgcccggcgtcaatacgggataataccgcgccacat 
agcagaactttaaaagtgctcatcattggaaaacgttcttcggggcgaaaactctcaaggat 
cttaccgctgttgagatccagttcgatgtaacccactcgtgcacccaactgatcttcagcat 
cttttactttcaccagcgtttctgggtgagcaaaaacaggaaggcaaaatgccgcaaaaaag 
ggaataagggcgacacggaaatgttgaatactcatactcttcctttttcaatattattgaag 
catttatcagggttattgtctcatgagcggatacatatttgaatgtatttagaaaaataaac 
aaataggggttccgcgcacatttccccgaaaagtgccacctgacgcgccctgtagcggcgca 
ttaagcgcggcgggtgtggtggttacgcgcagcgtgaccgctacacttgccagcgccctagc 
gcccgctcctttcgctttcttcccttcctttctcgccacgttcgccggctttccccgtcaag 
ctctaaatcgggggctccctttagggttccgatttagtgctttacggcacctcgaccccaaa 
aaacttgattagggtgatggttcacgtagtgggccatcgccctgatagacggtttttcgccc 
tttgacgttggagtccacgttctttaatagtggactcttgttccaaactggaacaacactca 
accctatctcggtctattcttttgatttataagggattttgccgatttcggcctattggtta 
aaaaatgagctgatttaacaaaaatttaacgcgaattttaacaaaatattaacgtttacaat 

TTCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGC 
TATTACGCCAGCCCAAGCTACCATGATAAGTAAGTAATATTAAGGTACGGGAGGTACTTGGA 
GCGGCCGCAATAAAATATCTTTATTTTCATTACATCTGTGTGTTGGTTTTTTGTGTGAATCG 
ATAGTACTAACATACGCTCTCCATCAAAACAAAACGAAACAAAACAAACTAGCAAAATAGGC 
TGTCCCCAGTGCAAGTGCAGGTGCCAGAACATTT 



Fig. 11 



wo 2006/093847 



15/45 



PCT/US2006/006752 



pCMV-pur-attB (SEQ ID NO: 4) 

CTAGAGTCGGGGCGGCCGGCCGCTTCGAGCAGACATGATAAGATACATTGATGAGTTTGGAC 

AAACCACAACTAGAATGCAGTGAAAAAAATGCTTTATTTGTGAAATTTGTGATGCTATTGCT 

TTATTTGTAACCATTATAAGCTGCAATAAACAAGTTAACAACAACAATTGCATTCATTTTAT 

GTTTCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTAAAGCAAGTAAAACCTCTACAAATGTG 

GTAAAATCGATAAGGATCAATTCGGCTTCAGGTACCGTCGACGATGTAGGTCACGGTCTCGA 

AGCCGCGGTGCGGGTGCCAGGGCGTGCCCTTGGGCTCCCCGGGCGCGTACTCCACCTCACCC 

ATCTGGTCCATCATGATGAACGGGTCGAGGTGGCGGTAGTTGATCCCGGCGAACGCGCGGCG 

CACCGGGAAGCCCTCGCCCTCGAAACCGCTGGGCGCGGTGGTCACGGTGAGCACGGGACGTG 

CGACGGCGTCGGCGGGTGCGGATACGCGGGGCAGCGTCAGCGGGTTCTCGACGGTCACGGCG 

GGCATGTCGACAGCCGAATTGATCCGTCGACCGATGCCCTTGAGAGCCTTCAACCCAGTCAG 

CTCCTTCCGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGACTGTCTTCTTTATCA 

TGCAACTCGTAGGACAGGTGCCGGCAGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCG 

CTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCA 

CAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAAC 

CGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAA 

AAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTC 

CCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCC 

GCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTAGGTATCTCAGTTC 

GGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCT 

GCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTG 

GCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTT 

GAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGA 

AGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGT 

AGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGA 

TCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTT 

TGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTT 

l^TCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGA 

GGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGT 

AGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGAC 

CCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAG 

AAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAG 

TAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTG 

TCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTAC 

ATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAA 

GTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTC 

ATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATA 

GTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATA 

GCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATC 

TTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATC 

TTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGG 

GAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGC 

ATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACA 

AATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGCGCCCTGTAGCGGCGCAT 

TAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCG 

CCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGC 

TCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAA 

AACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCT 

TTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAA 

CCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAA 

AAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGTTTACAATT 

TCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCT 

ATTACGCCAGCCCAAGCTACCATGATAAGTAAGTAATATTAAGGTACGGGAGGTACTTGGAG 

CGGCCGCAATAAAATATCTTTATTTTCATTACATCTGTGTGTTGGTTTTTTGTGTGAATCGA 
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TAGTACTAACATACGCTCTCCATCAAAACAAAACGAAACAAAACAAACTAGCAAAATAGGCT 
GTCCCCAGTGCAAGTGCAGGTGCCAGAACATTTCTCTATCGATAGGTACCGAGCTCTTACGC 
GTGCTAGCCCTCGAGCAGGATCTATACATTGAATCAATATTGGCAATTAGCCATATTAGTCA 
TTGGTTATATAGCATAAATCAATATTGGCTATTGGCCATTGCATACGTTGTATCTATATCAT 
AATATGTACATTTATATTGGCTCATGTCCAATATGACCGCCATGTTGACATTGATTATTGAC 
TAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCG 
TTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACG 
TCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGT 
GGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTCCGC 
CCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTA 
CGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCG 
GTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCC 
ACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGT 
CGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATAT 
AAGCAGAGCTCGTTTAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGACC 
TCCATAGAAGACACCGGGACCGATCCAGCCTCCCCTCGAAGCTCGACTCTAGGGGCTCGAGA 
TCTGCGATCTAAGTAAGCTTGCATGCCTGCAGGTCGGCCGCCACGACCGGTGCCGCCACCAT 
CCCCTGACCCACGCCCCTGACCCCTCACAAGGAGACGACCTTCCATGACCGAGTACAAGCCC 
ACGGTGCGCCTCGCCACCCGCGACGACGTCCCCCGGGCCGTACGCACCCTCGCCGCCGCGTT 
CGCCGACTACCCCGCCACGCGCCACACCGTCGACCCGGACCGCCACATCGAGCGGGTCACCG 
AGCTGCAAGAACTCTTCCTCACGCGCGTCGGGCTCGACATCGGCAAGGTGTGGGTCGCGGAC 
GACGGCGCCGCGGTGGCGGTCTGGACCACGCCGGAGAGCGTCGAAGCGGGGGCGGTGTTCGC 
CGAGATCGGCCCGCGCATGGCCGAGTTGAGCGGTTCCCGGCTGGCCGCGCAGCAACAGATGG 
AAGGCCTCCTGGCGCCGCACCGGCCCAAGGAGCCCGCGTGGTTCCTGGCCACCGTCGGCGTC 
TCGCCCGACCACCAGGGCAAGGGTCTGGGCAGCGCCGTCGTGCTCCCCGGAGTGGAGGCGGC 
CGAGCGCGCCGGGGTGCCCGCCTTCCTGGAGACCTCCGCGCCCCGCAACCTCCCCTTCTACG 
AGCGGCTCGGCTTCACCGTCACCGCCGACGTCGAGGTGCCCGAAGGACCGCGCACCTGGTGC 
ATGACCCGCAAGCCCGGTGCCTGACGCCCGCCCCACGACCCGCAGCGCCCGACCGAAAGGAG 
CGCACGACCCCATGGCTCCGACCGAAGCCGACCCGGGCGGCCCCGCCGACCCCGCACCCGCC 
CCCGAGGCCCACCGACT 



Fig. 12 



wo 2006/093847 



17/45 



PCT/US2006/006752 



pCMV-pur-attP (SEQ ID NO: 5) 

CTAGAGTCGGGGCGGCCGGCCGCTTCGAGCAGACATGATAAGATACATTGATGAGTTTGGAC 
AAACCACAACTAGAATGCAGTGAAAAAAATGCTTTATTTGTGAAATTTGTGATGCTATTGCT 

TTATTTGTAACCATTATAAGCTGCAATAAACAAGTTAACAACAACAATTGCATTCATTTTAT 
GTTTCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTAAAGCAAGTAAAACCTCTACAAATGTG 
GTAAAATCGATAAGGATCAATTCGGCTTCGACTAGTACTGACGGACACACCGAAGCCCCGGC 
GGCAACCCTCAGCGGATGCCCCGGGGCTTCACGTTTTCCCAGGTCAGAAGCGGTTTTCGGGA 
GTAGTGCCCCAACTGGGGTAACCTTTGAGTTCTCTCAGTTGGGGGCGTAGGGTCGCCGACAT 
GACACAAGGGGTTGTGACCGGGGTGGACACGTACGCGGGTGCTTACGACCGTCAGTCGCGCG 
AGCGCGACTAGTACAAGCCGAATTGATCCGTCGACCGATGCCCTTGAGAGCCTTC2^CCCAG 
TCAGCTCCTTCCGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGACTGTCTTCTTT 
ATCATGCAACTCGTAGGACAGGTGCCGGCAGCGCTCTTCCGCTTCCTCGCTCACTGACTCGC 
TGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTA 
TCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAG 
GAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATC 
ACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCG 
TTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCT 
GTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTAGGTATCTCA 
GTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGAC 
CGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCC 
ACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGT 
TCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTG 
CTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGC 
TGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAG 
AAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGG 
ATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAG 
TTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCA 
GTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTC 
GTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCG 
AGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGC 
GCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCT 
AGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGT 
GGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAG 
TTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTC 
AGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTAC 
TGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAG 
AATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCA 
CATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAG 
GATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAG 
CATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAA 
AAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTG 
AAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATA 
AACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGCGCCCTGTAGCGGC 
GCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCT 
AGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTC 
AAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCC 
AAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCG 
CCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACAC 
TCAACCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGG 
TTAAAAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGTTTAC 
AATTTCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTT 
CGCTATTACGCCAGCCCAAGCTACCATGATAAGTAAGTAATATTAAGGTACGGGAGGTACTT 
GGAGCGGCCGCAATAAAATATCTTTATTTTCATTACATCTGTGTGTTGGTTTTTTGTGTGAA 
TCGATAGTACTAACATACGCTCTCCATCAAAACAAAACGAAACAAAACAAACTAGCAAAATA 
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GGCTGTCCCCAGTGCAAGTGCAGGTGCCAGAACATTTCTCTATCGATAGGTACCGAGCTCTT 
ACGCGTGCTAGCCCTCGAGCAGGATCTATACATTGAATCAATATTGGCAATTAGCCATATTA 
GTCATTGGTTATATAGCATAAATCAATATTGGCTATTGGCCATTGCATACGTTGTATCTATA 
TCATAATATGTACATTTATATTGGCTCATGTCCAATATGACCGCCATGTTGACATTGATTAT 
TGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTC 
CGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATT 
GACGTGAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAAT 
GGGTGGAGTATTTACGGTAAACTGGCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGT 
CCGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGAC 
CTTACGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGA 
TGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGT 
CTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAA 
ATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCT 
ATATAAGCAGAGCTCGTTTAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTT 
GACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCCCTCGAAGCTCGACTCTAGGGGCTC 
GAGATCTGCGATCTAAGTAAGCTTGCATGCCTGCAGGTCGGCCGCCACGACCGGTGCCGCCA 
CCATCCCCTGACCCACGCCCCTGACCCCTCACAAGGAGACGACCTTCCATGACCGAGTACAA 
GCCCACGGTGCGCCTCGCCACCCGCGACGACGTCCCCCGGGCCGTACGCACCCTCGCCGCCG 
CGTTCGCCGACTACCCCGCCACGCGCCACACCGTCGACCCGGACCGCCACATCGAGCGGGTC 
ACCGAGCTGCAAGAACTCTTCCTCACGCGCGTCGGGCTCGACATCGGCAAGGTGTGGGTCGC 
GGACGACGGCGCCGCGGTGGCGGTCTGGACCACGCCGGAGAGCGTCGAAGCGGGGGCGGTGT 
TCGCCGAGATCGGCCCGCGCATGGCCGAGTTGAGCGGTTCCCGGCTGGCCGCGCAGCAACAG 
ATGGAAGGCCTCCTGGCGCCGCACCGGCCCAAGGAGCCCGCGTGGTTCCTGGCCACCGTCGG 
CGTCTCGCCCGACCACCAGGGCAAGGGTCTGGGCAGCGCCGTCGTGCTCCCCGGAGTGGAGG 
CGGCCGAGCGCGCCGGGGTGCCCGCCTTCCTGGAGACCTCCGCGCCCCGCAACCTCCCCTTC 
TACGAGCGGCTCGGCTTCACCGTCACCGCCGACGTCGAGGTGCCCGAAGGACCGCGCACCTG 
GTGCATGACCCGCAAGCCCGGTGCCTGACGCCCGCCCCACGACCCGCAGCGCCCGACCGA2^ 
GGAGCGCACGACCCCATGGCTCCGACCGAAGCCGACCCGGGCGGCCCCGCCGACCCCGCACC 
CGCCCCCGAGGCCCACCGACT 
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pCMV-EGFP-attB (SEQ ID NO: 6) 

CTAGAGTCGGGGCGGCCGGCCGCTTCGAGCAGACATGATAAGATACATTGATGAGTTTGGAC 
AAACCACAACTAGAATGCAGTGAAAAAAATGCTTTATTTGTGAAATTTGTGATGCTATTGCT 
TTATTTGTAACCATTATAAGCTGCAATAAACAAGTTAACAACAACAATTGCATTCATTTTAT 
GTTTCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTAAAGCAAGTAAAACCTCTACAAATGTG 
GTAAAATCGATAAGGATCAATTCGGCTTCAGGTACCGTCGACGATGTAGGTCACGGTCTCGA 
AGCCGCGGTGCGGGTGCCAGGGCGTGCCCTTGGGCTCCCCGGGCGCGTACTCCACCTCACCC 
ATCTGGTCCATCATGATGAACGGGTCGAGGTGGCGGTAGTTGATCCCGGCGAACGCGCGGCG 
CACCGGGAAGCCCTCGCCCTCGAAACCGCTGGGCGCGGTGGTCACGGTGAGCACGGGACGTG 
CGACGGCGTCGGCGGGTGCGGATACGCGGGGCAGCGTCAGCGGGTTCTCGACGGTCACGGCG 
GGCATGTCGACAGCCGAATTGATCCGTCGACCGATGCCCTTGAGAGCCTTGAACCCAGTCAG 
CTCCTTCCGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGACTGTCTTCTTTATCA 
TGCAACTCGTAGGACAGGTGCCGGCAGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCG 
CTCGGTCGTTCGGCTGCGGCGAGCGGJ'ATCAGCTCACTCAAAGGCGGTAATACGGTTATCCA 
CAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAAC 
CGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAA 
AAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTC 
CCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCC 
GCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTAGGTATCTCAGTTC 
GGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCT 
GCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTG 
GCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTT 
GAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGA 
AGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGT 
AGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGA 
TCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTT 
TGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTT 
AAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGA 
GGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGT 
AGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGAC 
CCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAG 
2y\GTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAG 
TAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTG 
TCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTAC 
ATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAA 
GTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTC 
ATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATA 
GTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATA 
GCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATC 
TTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATC 
TTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGG 
GAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTC2\ATATTATTGAAGC 
ATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACA 
AATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGCGCCCTGTAGCGGCGCAT 
TAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCG 
CCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGC 
TCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAA 
AACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCT 
TTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAA 
CCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAA 
AAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGTTTACAATT 
TCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCT 
ATTACGCCAGCCCAAGCTACCATGATAAGTAAGTAATATTAAGGTACGGGAGGTACTTGGAG 
CGGCCGCAATAAAATATCTTTATTTTCATTACATCTGTGTGTTGGTTTTTTGTGTGAATCGA 
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TAGTACTAACATACGCTCTCCATCAAAACAAAACGAAACAAAACAAACTAGCAAAATAGGCT 
GTCCCCAGTGCAAGTGCAGGTGCCAGAACATTTCTCTATCGATAGGTACCGAGCTCTTACGC 
GTGCTAGCCCTCGAGCAGGATCTATACATTGAATCAATATTGGCAATTAGCCATATTAGTCA 
TTGGTTATATAGCATAAATCAATATTGGCTATTGGCCATTGCATACGTTGTATCTATATCAT 
AATATGTACATTTATATTGGCTCATGTCCAATATGACCGCCATGTTGACATTGATTATTGAC 
TAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCG 
TTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACG 
TCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGT 
GGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTCCGC 
CCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTA 
CGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCG 
GTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCC 
ACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGT 
CGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATAT 
AAGCAGAGCTCGTTTAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGACC 
TCCATAGAAGACACCGGGACCGATCCAGCCTCCCCTCGAAGCTCGACTCTAGGGGCTCGAGA 
TCCCCGGGTACCGGTCGCCACCATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGC 
CCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGGGC 
GAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCC 
CGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACC 
CCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAG 
CGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG 
CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCC 
TGGGGCACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAG 
AAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCT 
CGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACC 
ACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTC 
CTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAAAG 
CGGCCGCTCGAGCATGCAT 
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p-12.0-lys-LSPIFNMM-CMV-pur-attB (SEQ ID NO: 7) 

GGGCTGCAGGAATTCGATTGCCGCCTTCTTTGATATTCACTCTGTTGTATTTCATCTCTTCT 
TGCCGATGAAAGGATATAACAGTCTGTATAACAGTCTGTGAGGAAATACTTGGTATTTCTTC 
TGATCAGTGTTTTTATAAGTAATGTTGAATATTGGATAAGGCTGTGTGTCCTTTGTCTTGGG 
AGACAAAGCCCACAGCAGGTGGTGGTTGGGGTGGTGGCAGCTCAGTGACAGGAGAGGTTTTT 

TTGCCTGTTTTTTTTTTTTTTTTTTTTTTTAAGTAAGGTGTTCTTTTTTCTTAGTAAATTTT 
CTACTGGACTGTATGTTTTGACAGGTCAGAAACATTTCTTCAAAAGAAGAACCTTTTGGAAA 
CTGTACAGCCCTTTTCTTTCATTCCCTTTTTGCTTTCTGTGCCAATGCCTTTGGTTCTGATT 
GCATTATGGAAAACGTTGATGGGAACTTGAGGTTTTTATTTATAGTGTGGCTTGAAAGCTTG 
GATAGCTGTTGTTACACGAGATACCTTATTAAGTTTAGGCCAGCTTGATGCTTTATTTTTTC 
CCTTTGAAGTAGTGAGCGTTCTCTGGTTTTTTTCCTTTGAAACTGGTGAGGCTTAGATTTTT 
CTAATGGGATTTTTTACCTGATGATCTAGTTGCATACCCAAATGCTTGTAAATGTTTTCCTA 
GTTAACATGTTGATAACTTCGGATTTACATGTTGTATATACTTGTCATCTGTGTTTCTAGTA 
AAAATATATGGCATTTATAGAAATACGTAATTCCTGATTTCCTTTTTTTTTATCTCTATGCT 
CTGTGTGTACAGGTCAAACAGACTTCACTCCTATTTTTATTTATAGAATTTTATATGCAGTC 
TGTCGTTGGTTCTTGTGTTGTAAGGATACAGCCTTAAATTTCCTAGAGCGATGCTCAGTAAG 
GCGGGTTGTCACATGGGTTCAAATGTAAAACGGGCACGTTTGGCTGCTGCCTTCCCGAGATC 
CAGGACACTAAACTGCTTCTGCACTGAGGTATAAATCGCTTCAGATCCCAGGGAAGTGCAGA 
TCCACGTGCATATTCTTAAAGAAGAATGAATACTTTCTAAAATATTTTGGCATAGGAAGCAA 
GCTGCATGGATTTGTTTGGGACTTAAATTATTTTGGTAACGGAGTGCATAGGTTTTAAACAC 
AGTTGCAGCATGCTAACGAGTCACAGCGTTTATGCAGAAGTGATGCCTGGATGCCTGTTGCA 
GCTGTTTACGGCACTGCCTTGCAGTGAGCATTGCAGATAGGGGTGGGGTGCTTTGTGTCGTG 
TTCCCACACGCTGCCACACAGCCACCTCCCGGAACACATCTCACCTGCTGGGTACTTTTCAA 
ACCATCTTAGCAGTAGTAGATGAGTTACTATGAAACAGAGAAGTTCGTCAGTTGGATATTCT 
CATGGGATGTCTTTTTTCCCATGTTGGGCAAAGTATGATAAAGCATCTCTATTTGTAAATTA 
TGCACTTGTTAGTTCCTGAATCCTTTCTATAGCACCACTTATTGCAGCAGGTGTAGGCTCTG 
GTGTGGCCTGTGTCTGTGCTTCAATCTTTTAAAGCTTCTTTGGAAATACACTGACTTGATTG 
AAGTCTCTTGAAGATAGTAAACAGTACTTACCTTTGATCCCAATGAAATCGAGCATTTCAGT 
TGTAAAAGAATTCCGCCTATTCATACCATGTAATGTAATTTTACACCCCCAGTGCTGACACT 
TTGGAATATATTCAAGTAATAGACTTTGGCCTCACCCTCTTGTGTACTGTATTTTGTAATAG 
AAAATATTTTAAACTGTGCATATGATTATTACATTATGAAAGAGACATTCTGCTGATCTTCA 
AATGTAAGAAAATGAGGAGTGCGTGTGCTTTTATAAATACAAGTGATTGCAAATTAGTGCAG 
GTGTCCTTAAAAAAAAAAAAAAAAAGTAATATAAAAAGGACCAGGTGTTTTACAAGTGAAAT 
ACATTCCTATTTGGTAAACAGTTACATTTTTATGAAGATTACCAGCGCTGCTGACTTTCTAA 
ACATAAGGCTGTATTGTCTTCCTGTACCATTGCATTTCCTCATTCCCAATTTGCACAAGGAT 
GTCTGGGTAAACTATTCAAGAAATGGCTTTGAAATACAGCATGGGAGCTTGTCTGAGTTGGA 
ATGCAGAGTTGCACTGCAAAATGTCAGGAAATGGATGTCTCTCAGAATGCCCAACTCCAAAG 
GATTTTATATGTGTATATAGTAAGCAGTTTCCTGATTCCAGCAGGCCAAAGAGTCTGCTGAA 
TGTTGTGTTGCCGGAGACCTGTATTTCTCAACAAGGTAAGATGGTATCCTAGCAACTGCGGA 
TTTTAATACATTTTCAGCAGAAGTACTTAGTTAATCTCTACCTTTAGGGATCGTTTCATCAT 
TTTTAGATGTTATACTTGAAATACTGCATAACTTTTAGCTTTCATGGGTTCCTTTTTTTCAG 
CCTTTAGGAGACTGTTAAGCAATTTGCTGTCCAACTTTTGTGTTGGTCTTAAACTGCAATAG 
TAGTTTACCTTGTATTGAAGAAATAAAGACCATTTTTATATTAAAAAATACTTTTGTCTGTC 
TTCATTTTGACTTGTCTGATATCCTTGCAGTGCCCATTATGTCAGTTCTGTCAGATATTCAG 
ACATCAAAACTTAACGTGAGCTCAGTGGAGTTACAGCTGCGGTTTTGATGCTGTTATTATTT 
CTGAAACTAGAAATGATGTTGTCTTCATCTGCTCATCAAACACTTCATGCAGAGTGTAAGGC 
TAGTGAGAAATGCATACATTTATTGATACTTTTTTAAAGTCAACTTTTTATCAGATTTTTTT 
TTCATTTGGAAATATATTGTTTTCTAGACTGCATAGCTTCTGAATCTGAAATGCAGTCTGAT 
TGGCATGAAGAAGCACAGCACTCTTCATCTTACTTAAACTTCATTTTGGAATGAAGGAAGTT 
AAGCAAGGGCACAGGTCCATGAAATAGAGACAGTGCGCTCAGGAGAAAGTGAACCTGGATTT 
CTTTGGCTAGTGTTCTAAATCTGTAGTGAGGAAAGTAACACCCGATTCCTTGAAAGGGCTCC 
AGCTTTAATGCTTCCAAATTGAAGGTGGCAGGCAACTTGGCCACTGGTTATTTACTGCATTA 
TGTCTCAGTTTCGCAGCTAACCTGGCTTCTCCACTATTGAGCATGGACTATAGCCTGGCTTC 
AGAGGCCAGGTGAAGGTTGGGATGGGTGGAAGGAGTGCTGGGCTGTGGCTGGGGGGACTGTG 
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GGGACTCCAAGCTGAGCTTGGGGTGGGCAGCACAGGGAAAAGTGTGGGTAACTATTTTTAAG 
TACTGTGTTGCAAACGTCTCATCTGCAAATACGTAGGGTGTGTACTCTCGAAGATTAACAGT 
GTGGGTTCAGTAATATATGGATGAATTCACAGTGGAAGCATTCAAGGGTAGATCATCTAACG 
ACACCAGATCATCAAGCTATGATTGGAAGCGGTATCAGAAGAGCGAGGAAGGTAAGCAGTCT 
TCATATGTTTTCCCTCCACGTAAAGCAGTCTGGGAAAGTAGCACCCCTTGAGCAGAGACAAG 
GAAATAATTCAGGAGCATGTGCTAGGAGAACTTTCTTGCTGAATTCTACTTGCAAGAGCTTT 
GATGCCTGGCTTCTGGTGCCTTCTGCAGCACCTGCAAGGCCCAGAGCCTGTGGTGAGCTGGA 
GGGAAAGATTCTGCTCAAGTCCAAGCTTCAGCAGGTCATTGTCTTTGCTTCTTCCCCCAGCA 
CTGTGCAGCAGAGTGGAACTGATGTGGAAGCGTCCTGTCCACTACCTGTTGCTGCAGGCAGA 
CTGCTCTCAGAAAAAGAGAGCTAACTCTATGCCATAGTCTGAAGGTAAAATGGGTTTTAAAA 
AAGAAAACACAAAGGCAAAACCGGCTGCCCCATGAGAAGAAAGCAGTGGTAAACATGGTAGA 
AAAGGTGCAGAAGCCCCCAGGCAGTGTGACAGGCCCCTCCTGCCACCTAGAGGCGGGAACAA 
GCTTCCCTGCCTAGGGCTCTGCCCGCGAAGTGCGTGTTTCTTTGGTGGGTTTTGTTTGGCGT 
TTGGTTTTGAGATTTAGACACAAGGGAAGCCTGAAAGGAGGTGTTGGGCACTATTTTGGTTT 
GTAAAGCCTGTACTTCAAATATATATTTTGTGAGGGAGTGTAGCGAATTGGCCAATTTAAAA 
TAAAGTTGCAAGAGATTGAAGGCTGAGTAGTTGAGAGGGTAACACGTTTAATGAGATCTTCT 
GAAACTACTGCTTCTAAACACTTGTTTGAGTGGTGAGACCTTGGATAGGTGAGTGCTCTTGT 
TACATGTCTGATGCACTTGCTTGTCCTTTTCCATCCACATCCATGCATTCCACATCCACGCA 
TTTGTCACTTATCCCATATCTGTCATATCTGACATACCTGTCTCTTCGTCACTTGGTCAGAA 
GAAACAGATGTGATAATCCCCAGCCGCCCCAAGTTTGAGAAGATGGCAGTTGCTTCTTTCCC 
TTTTTCCTGCTAAGTAAGGATTTTCTCCTGGCTTTGACACCTCACGAAATAGTCTTCCTGCC 
TTACATTCTGGGCATTATTTCAAATATCTTTGGAGTGCGCTGCTCTCAAGTTTGTGTCTTCC 
TACTCTTAGAGTGAATGCTCTTAGAGTGAAAGAGAAGGAAGAGAAGATGTTGGCCGCAGTTC 
TCTGATGAACACACCTCTGAATAATGGCCAAAGGTGGGTGGGTTTCTCTGAGGAACGGGCAG 
CGTTTGCCTCTGAAAGCAAGGAGCTCTGCGGAGTTGCAGTTATTTTGCAACTGATGGTGGAA 
CTGGTGCTTAAAGCAGATTCCCTAGGTTCCCTGCTACTTCTTTTCCTTCTTGGCAGTCAGTT 
TATTTCTGACAGACAAACAGCCACCCCCACTGCAGGCTTAGAAAGTATGTGGCTCTGCCTGG 
GTGTGTTACAGCTCTGCCCTGGTGAAAGGGGATTAAAACGGGCACCATTCATCCCAAACAGG 
ATCCTCATTCATGGATCAAGCTGTAAGGAACTTGGGCTCCAACCTCAAAACATTAATTGGAG 
TACGAATGTAATTAAAACTGCATTCTCGCATTCCTAAGTCATTTAGTCTGGACTCTGCAGCA 
TGTAGGTCGGCAGCTCCCACTTTCTCAAAGACCACTGATGGAGGAGTAGTAAAAATGGAGAC 
CGATTCAGAACAACCAACGGAGTGTTGCCGAAGAAACTGATGGAAATAATGCATGAATTGTG 
TGGTGGACATTTTTTTTAAATACATAAACTACTTCAAATGAGGTCGGAGAAGGTCAGTGTTT 
TATTAGCAGCCATAAAACCAGGTGAGCGAGTACCATTTTTCTCTACAAGAAAAACGATTCTG 
AGCTCTGCGTAAGTATAAGTTCTCCATAGCGGCTGAAGCTCCCCCCTGGCTGCCTGCCATCT 
CAGCTGGAGTGCAGTGCCATTTCCTTGGGGTTTCTCTCACAGCAGTAATGGGACAATACTTC 
ACAAAAATTCTTTCTTTTCCTGTCATGTGGGATCCCTACTGTGCCCTCCTGGTTTTACGTTA 
CCCCCTGACTGTTCCATTCAGCGGTTTGGAAAGAGAAAAAGAATTTGGAAATAAAACATGTC 
TACGTTATCACCTCCTCCAGCATTTTGGTTTTTAATTATGTCAATAACTGGCTTAGATTTGG 
AAATGAGAGGGGGTTGGGTGTATTACCGAGGAACAAAGGAAGGCTTATATAAACTCAAGTCT 
TTTATTTAGAGAACTGGCAAGCTGTCAAAAACAAAAAGGCCTTACCACCAAATTAAGTGAAT 
AGCCGCTATAGCCAGCAGGGCCAGCACGAGGGATGGTGCACTGCTGGCACTATGCCACGGCC 
TGCTTGTGACTCTGAGAGCAACTGCTTTGGAAATGACAGCACTTGGTGCAATTTCCTTTGTT 
TCAGAATGCGTAGAGCGTGTGCTTGGCGACAGTTTTTCTAGTTAGGCCACTTCTTTTTTCCT 
TCTCTCCTCATTCTCCTAAGCATGTCTCCATGCTGGTAATCCCAGTCAAGTGAACGTTCAAA 
CAATGAATCCATCACTGTAGGATTCTCGTGGTGATCAAATCTTTGTGTGAGGTCTATAAAAT 
ATGGAAGCTTATTTATTTTTCGTTCTTCCATATCAGTCTTCTCTATGACAATTCACATCCAC 
CACAGCAAATTAAAGGTGAAGGAGGCTGGTGGGATGAAGAGGGTCTTCTAGCTTTACGTTCT 
TCCTTGCAAGGCCACAGGAAAATGCTGAGAGCTGTAGAATACAGCCTGGGGTAAGAAGTTCA 
GTCTCCTGCTGGGACAGCTAACCGCATCTTATAACCCCTTCTGAGACTCATCTTAGGACCAA 
ATAGGGTSTATCTGGGGTTTTTGTTCCTGCTGTTCCTCCTGGAAGGCTATCTCACTATTTCA 
CTGCTCCCACGGTTACAAACCAAAGATACAGCCTGAATTTTTTCTAGGCCACATTACATAAA 
TTTGACCTGGTACCAATATTGTTCTCTATATAGTTATTTCCTTCCCCACTGTGTTTAACCCC 
TTAAGGCATTCAGAACAACTAGAATCATAGAATGGTTTGGATTGGAAGGGGCCTTAAACATC 
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ATCCATTTCCAACCCTCTGCCATGGGCTGCTTGCCACCCACTGGCTCAGGCTGCCCAGGGCC 
CCATCCAGCCTGGCCTTGAGCACCTCCAGGGATGGGGCACCCACAGCTTCTCTGGGCAGCCT 
GTGCCAACACCTCACCACTCTCTGGGTAAAGAATTCTCTTTTAACATCTAATCTAAATCTCT 
TCTCTTTTAGTTTAAAGCCATTCCTCTTTTTCCCGTTGCTATCTGTCCAAGAAATGTGTATT 
GGTCTCCCTCCTGCTTATAAGCAGGAAGTACTGGAAGGCTGCAGTGAGGTCTCCCCACAGCC 
TTCTCTTCTCCAGGCTGAACAAGCCCAGCTCCTTCAGCCTGTCTTCGTAGGAGATCATCTTA 
GTGGCCCTCCTCTGGACCCATTCCAACAGTTCCACGGCTTTCTTGTGGAGCCCCAGGTCTGG 
ATGCAGTACTTCAGATGGGGCCTTACAAAGGCAGAGCAGATGGGGACAATCGCTTACCCCTC 
CCTGCTGGCTGCCCCTGTTTTGATGCAGCCCAGGGTACTGTTGGCCTTTCAGGCTCCCAGAC 
CCCTTGCTGATTTGTGTCAAGCTTTTCATCCACCAGAACCCACGCTTCCTGGTTAATACTTC 
TGCCCTCACTTCTGTAAGCTTGTTTCAGGAGACTTCCATTCTTTAGGACAGACTGTGTTACA 
CCTACCTGCCCTATTCTTGCATATATACATTTCAGTTCATGTTTCCTGTAACAGGACAGAAT 
ATGTATTCCTCTAACAAAAATACATGCAGAATTGCTAGTGCCATCTCAGTAGGGTTTTCATQ 
GCAGTATTAGCACATAGTCAATTTGCTGCAAGTACCTTCCAAGCTGCGGCCTCCCATAAATC 
CTGTATTTGGGATCAGTTACCTTTTGGGGTAAGCTTTTGTATCTGCAGAGACCCTGGGGGTT 
CTGATGTGCTTCAGCTCTGCTCTGTTCTGACTGCACCATTTTCTAGATCACCCAGTTGTTCC 
TGTACAACTTCCTTGTCCTCCATCCTTTCCCAGCTTGTATCTTTGACAAATACAGGCCTATT 
TTTGTGTTTGCTTCAGCAGCCATTTAATTCTTCAGTGTCATCTTGTTCTGTTGATGCCACTG 
GAACAGGATTTTCAGCAGTCTTGCAAAGAACATCTAGCTGAAAACTTTCTGCCATTCAATAT 
TCTTACCAGTTCTTCTTGTTTGAGGTGAGCCATAAATTACTAGAACTTCGTCACTGACAAGT 
TTATGCATTTTATTACTTCTATTATGTACTTACTTTGACATAACACAGACACGCACATATTT 
TGCTGGGATTTCCACAGTGTCTCTGTGTCCTTCACATGGTTTTACTGTCATACTTCCGTTAT 
AACCTTGGCAATCTGCCCAGCTGCCCATCACAAGAAAAGAGATTCCTTTTTTATTACTTCTC 
TTCAGCCAATAAACAAAATGTGAGAAGCCCAAACAAGAACTTGTGGGGCAGGCTGCCATCAA 
GGGAGAGACAGCTGAAGGGTTGTGTAGCTCAATAGAATTAAGAAATAATAAAGCTGTGTCAG 
ACAGTTTTGCCTGATTTATACAGGCACGCCCCAAGCCAGAGAGGCTGTCTGCCAAGGCCACC 
TTGCAGTCCTTGGTTTGTAAGATAAGTCATAGGTAACTTTTCTGGTGAATTGCGTGGAGAAT 
CATGATGGCAGTTCTTGCTGTTTACTATGGTAAGATGCTAAAATAGGAGACAGCAAAGTAAC 
ACTTGCTGCTGTAGGTGCTCTGCTATCCAGACAGCGATGGCACTCGCACACCAAGATGAGGG 
ATGCTCCCAGCTGACGGATGCTGGGGCAGTAACAGTGGGTCCCATGCTGCCTGCTCATTAGC 
ATCACCTCAGCCCTCACCAGCCCATCAGAAGGATCATCCCAAGCTGAGGAAAGTTGCTCATC 
TTCTTCACATCATCAAACCTTTGGCCTGACTGATGCCTCCCGGATGCTTAAATGTGGTCACT 
GACATCTTTATTTTTCTATGATTTCAAGTCAGAACCTCCGGATCAGGAGGGAACACATAGTG 
GGAATGTACCCTCAGCTCCAAGGCCAGATCTTCCTTCAATGATCATGCATGCTACTTAGGAA 
GGTGTGTGTGTGTGAATGTAGAATTGCCTTTGTTATTTTTTCTTCCTGCTGTCAGGAACATT 
TTGAATACCAGAGAAAAAGAAAAGTGCTCTTCTTGGCATGGGAGGAGTTGTCACACTTGCAA 
AATAAAGGATGCAGTCCCAAATGTTCATAATCTCAGGGTCTGAAGGAGGATCAGAAACTGTG 
TATACAATTTCAGGCTTCTCTGAATGCAGCTTTTGAAAGCTGTTCCTGGCCGAGGCAGTACT 
AGTCAG2\ACCCTCGGAAACAGGAACAAATGTCTTCAAGGTGCAGCAGGAGGAAACACCTTGC 
CCATCATGAAAGTGAATAACCACTGCCGCTGAAGGAATCCAGCTCCTGTTTGAGCAGGTGCT 
GCACACTCCCACACTGAAACAACAGTTCATTTTTATAGGACTTCCAGGAAGGATCTTCTTCT 
TAAGCTTCTTAATTATGGTACATCTCCAGTTGGCAGATGACTATGACTACTGACAGGAGAAT 
GAGGAACTAGCTGGGAATATTTCTGTTTGACCACCATGGAGTCACCCATTTCTTTACTGGTA 
T T T GGAAAT AAT AAT T C T G AAT T G C AAA GO AGGAGT T AGO GA AG AT CTTCATTTCTTCCATG 
TTGGTGACAGCACAGTTCTGGCTATGAAAGTCTGCTTACAAGGAAGAGGATAAAAATCATAG 
GGATAATAAATCTAAGTTTGAAGACAATGAGGTTTTAGCTGCATTTGACATGAAGAAATTGA 
GACCTCTACTGGATAGCTATGGTATTTACGTGTCTTTTTGCTTAGTTACTTATTGACCCCAG 
CTGAGGTCAAGTATGAACTCAGGTCTCTCGGGCTACTGGCATGGATTGATTACATACAACTG 
TAATTTTAGCAGTGATTTAGGGTTTATGAGTACTTTTGCAGTAAATCATAGGGTTAGTAATG 
TTAATCTCAGGGAAAAAAAAAAAAAGCCAACCCTGACAGACATCCCAGCTCAGGTGGAAATC 
AAGGATCACAGCTCAGTGCGGTCCCAGAGAACACAGGGACTCTTCTCTTAGGACCTTTATGT 
ACAGGGCCTCAAGATAACTGATGTTAGTCAGAAGACTTTCCATTCTGGCCACAGTTCAGCTG 
AGGCAATCCTGGAATTTTCTCTCCGCTGCACAGTTCCAGTCATCCCAGTTTGTACAGTTCTG 
GCACTTTTTGGGTCAGGCCGTGATCCAAGGAGCAGAAGTTCCAGCTATGGTCAGGGAGTGCC 
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TGACCGTCCCAACTCACTGCACTCAAACAAAGGCGAAACCACAAGAGTGGCTTTTGTTGAT^ 
TTGCAGTGTGGCCCAGAGGGGCTGCACCAGTACTGGATTGACCACGAGGCAACATTAATCCT 
CAGCAAGTGCAATTTGCAGCCATTAAATTGAACTAACTGATACTACAATGCAATCAGTATCA 
ACAAGTGGTTTGGCTTGGAAGATGGAGTCTAGGGGCTCTACAGGAGTAGCTACTCTCTAATG 
GAGTTGCATTTTGAAGCAGGACACTGTGAAAAGCTGGCCTCCTAAAGAGGCTGCTAAACATT 
AGGGTCAATTTTCCAGTGCACTTTCTGAAGTGTCTGCAGTTCCCCATGCAAAGCTGCCCAAA 
CATAGCACTTCCAATTGAATACAATTATATGCAGGCGTACTGCTTCTTGCCAGCACTGTCCT 
TCTCAAATGAACTCAACAAACAATTTCAAAGTCTAGTAGAAAGTAACAAGCTTTGAATGTCA 
TTAAAAAGTATATCTGCTTTCAGTAGTTCAGCTTATTTATGCCCACTAGAAACATCTTGTAC 
AAGCTGAACACTGGGGCTCCAGATTAGTGGTAAAACCTACTTTATACAATCATAGAATCATA 
GAATGGCCTGGGTTGGAAGGGACCCCAAGGATCATGAAGATCCAACACCCCCGCCACAGGCA 
GGGCCACCAACCTCCAGATCTGGTACTAGACCAGGCAGCCCAGGGCTCCATCCAACCTGGCC 
ATGAACACCTCCAGGGATGGAGCATCCACAACCTCTCTGGGCAGCCTGTGCCAGCACCTCAC 
CACCCTCTCTGTGAAGAACTTTTCCCTGACATCCAATCTAAGCCTTCCCTCCTTGAGGTTAG 
ATCCACTCCCCCTTGTGCTATCACTGTCTACTCTTGTAAAAAGTTGATTCTCCTCCTTTTTG 
GAAGGTTGCAATGAGGTCTCCTTGCAGCCTTCTTCTCTTCTGCAGGATGAACAAGCCCAGCT 
CCCTCAGCCTGTCTTTATAGGAGAGGTGCTCCAGCCCTCTGATCATCTTTGTGGCCCTCCTC 
TGGACCCGCTCCAAGAGCTCCACATCTTTCCTGTACTGGGGGCCCCAGGCCTGAATGCAGTA 
CTCCAGATGGGGCCTCAAAAGAGCAGAGTAAAGAGGGACAATCACCTTCCTCACCCTGCTGG 
CCAGCCCTCTTCTGATGGAGCCCTGGATACAACTGGCTTTCTGAGCTGCAACTTCTCCTTAT 
CAGTTCCACTATTAAAACAGGAACAATACAACAGGTGCTGATGGCCAGTGCAGAGTTTTTCA 
CACTTCTTCATTTCGGTAGATCTTAGATGAGGAACGTTGAAGTTGTGCTTCTGCGTGTGCTT 
CTTCCTCCTCAAATACTCCTGCCTGATACCTCACCCCACCTGCCACTGAATGGCTCCATGGC 
CCCCTGCAGCCAGGGCCCTGATGAACCCGGCACTGCTTCAGATGCTGTTTAATAGCACAGTA 
TGACCAAGTTGCACCTATGAATACACAAACAATGTGTTGCATCCTTCAGCACTTGAGAAGAA 
GAGCCAZVATTTGCATTGTCAGGAAATGGTTTAGTAATTCTGCCAATTAAAACTTGTTTATCT 
ACCATGGCTGTTTTTATGGCTGTTAGTAGTGGTACACTGATGATGAACAATGGCTATGCAGT 
AAAATCAAGACTGTAGATATTGCAACAGACTATAAAATTCCTCTGTGGCTTAGCCAATGTGG 
TACTTCCCACATTGTATAAGAAATTTGGCAAGTTTAGAGCAATGTTTGAAGTGTTGGGAAAT 
TTCTGTATACTCAAGAGGGCGTTTTTGACAACTGTAGAACAGAGGAATCAAAAGGGGGTGGG 
AGGAAGTTAAAAGAAGAGGCAGGTGCAAGAGAGCTTGCAGTCCCGCTGTGTGTACGACACTG 
GCAACATGAGGTCTTTGCTAATCTTGGTGCTTTGCTTCCTGCCCCTGGCTGCCTTAGGGTGC 
GATCTGCCTCAGACCCACAGCCTGGGCAGCAGGAGGACCCTGATGCTGCTGGCTCAGATGAG 
GAGAATCAGCCTGTTTAGCTGCCTGAAGGATAGGCACGATTTTGGCTTTCCTCAAGAGGAGT 
TTGGCAACCAGTTTCAGAAGGCTGAGACCATCCCTGTGCTGCACGAGATGATCCAGCAGATC 
TTTAACCTGTTTAGCACCAAGGATAGCAGCGCTGCTTGGGATGAGACCCTGCTGGATAAGTT 
TTACACCGAGCTGTACCAGCAGCTGAACGATCTGGAGGCTTGCGTGATCCAGGGCGTGGGCG 
TGACCGAGACCCCTCTGATGAAGGAGGATAGCATCCTGGCTGTGAGGAAGTACTTTCAGAGG 
ATCACCCTGTACCTGAAGGAGAAGAAGTACAGCCCCTGCGCTTGGGAAGTCGTGAGGGCTGA 
GATCATGAGGAGCTTTAGCCTGAGCACCAACCTGCAAGAGAGCTTGAGGTCTAAGGAGTAAA 
AAGTCTAGAGTCGGGGCGGCCGGCCGCTTCGAGCAGACATGATAAGATACATTGATGAGTTT 
GGACAAACCACAACTAGAATGCAGTGAAAAAAATGCTTTATTTGTGAAATTTGTGATGCTAT 
TGCTTTATTTGTAACCATTATAAGCTGCAATAAACAAGTTAACAACAACAATTGCATTCATT 
TTATGTTTCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTAAAGCAAGTAAAACCTCTACAAA 
TGTGGTAAAATCGATAAGGATCCGTCGACCGATGCCCTTGAGAGCCTTCAACCCAGTCAGCT 
CCTTCCGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGACTGTCTTCTTTATCATG 
CAACTCGTAGGACAGGTGCCGGCAGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCT 
CGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACA 
GAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCG 
TAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAA 
ATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCC 
CCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGC 
CTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTAGGTATCTCAGTTCGG 
TGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGC 
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GCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGC 

AGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGA 

AGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAG 

CCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAG 

CGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATC 

CTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTG 

GTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAA 

ATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGG 

CACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAG 

ATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCC 

ACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAA 

GTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTA 

AGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTGTC 

ACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACAT 

GATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGT 

AAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCAT 

GCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGT 

GTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGC 

AGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTT 

ACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTT 

TTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGA 

ATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCAT 

TTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAA 

TAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGCGCCCTGTAGCGGCGCATTA 

AGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCC 

CGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTC 

TAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAA 

CTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTT 

GACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACC 

CTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAA 

AATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGTTTACAATTTC 

CCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCTAT 

TACGCCAGCCCAAGCTACCATGATAAGTAAGTAATATTAAGGTACGGGAGGTACTTGGAGCG 

GCCGCTCTAGAACTAGTGGATCCCCCGGCCGCAATAAAATATCTTTATTTTCATTACATCTG 

TGTGTTGGTTTTTTGTGTGAATCGATAGTACTAACATACGCTCTCCATCAAAACAAAACGAA 

AC AAAACA AACT AGC AAAAT AGGCT GT CCCCAGT GC AAGT GO AGGT GCCAGAAC AT TTCT CT 

ATCGATAGGTACCGAGCTCTTACGCGTGCTAGCCCTCGAGCAGGATCTATACATTGAATCAA 

TATTGGCAATTAGCCATATTAGTCATTGGTTATATAGCATAAATCAATATTGGCTATTGGCC 

ATTGCATACGTTGTATCTATATCATAATATGTACATTTATATTGGCTCATGTCCAATATGAC 

CGCCATGTTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTT 

CATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACC 

GCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAG 

GGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACAT 

CAAGTGTATCATATGCCAAGTCCGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTG 

GCATTATGCCCAGTACATGACCTTACGGGACTTTCCTACTTGGCAGTACATCTACGTATTAG 

TCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTT 

GACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCA 

AAATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTA 

GGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCGTTTAGTGAACCGTCAGATCGCCTGG 

AGACGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCCCTC 

GAAGCTCGACTCTAGGGGCTCGAGATCTGCGATCTAAGTAAGCTTGCATGCCTGCAGGTCGG 

CCGCCACGACCGGTGCCGCCACCATCCCCTGACCCACGCCCCTGACCCCTCACAAGGAGACG 

ACCTTCCATGACCGAGTACAAGCCCACGGTGCGCCTCGCCACCCGCGACGACGTCCCCCGGG 

CCGTACGCACCCTCGCCGCCGCGTTCGCCGACTACCCCGCCACGCGCCACACCGTCGACCCG 
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GACCGCCACATCGAGCGGGTCACCGAGCTGCAAGAACTCTTCCTCACGCGCGTCGGGCTCGA 
CATCGGCAAGGTGTGGGTCGCGGACGACGGCGCCGCGGTGGCGGTCTGGACCACGCCGGAGA 
GCGTCGAAGCGGGGGCGGTGTTCGCCGAGATCGGCCCGCGCATGGCCGAGTTGAGCGGTTCC 
CGGCTGGCCGCGCAGCAACAGATGGAAGGCCTCCTGGCGCCGCACCGGCCCAAGGAGCCCGC 
GTGGTTCCTGGCCACCGTCGGCGTCTCGCCCGACCACCAGGGCAAGGGTCTGGGCAGCGCCG 
TCGTGCTCCCCGGAGTGGAGGCGGCCGAGCGCGCCGGGGTGCCCGCCTTCCTGGAGACCTCC 
GCGCCCCGCAACCTCCCCTTCTACGAGCGGCTCGGCTTCACCGTCACCGCCGACGTCGAGGT 
GCCCGAAGGACCGCGCACCTGGTGCATGACCCGCAAGCCCGGTGCCTGACGCCCGCCCCACG 
ACCCGCAGCGCCCGACCGAAAGGAGCGCACGACCCCATGGCTCCGACCGAAGCCGACCCGGG 
CGGCCCCGCCGACCCCGCACCCGCCCCCGAGGCCCACCGACTCTAGAGTCGGGGCGGCCGGC 
CGCTTCGAGCAGACATGATAAGATACATTGATGAGTTTGGACAAACCACAACTAGAATGCAG 
TGAAAAAAATGCTTTATTTGTGAAATTTGTGATGCTATTGCTTTATTTGTAACCATTATAAG 
CTGCAATAAACAAGTTAACAACAACAATTGCATTCATTTTATGTTTCAGGTTCAGGGGGAGG 
TGTGGGAGGTTTTTTAAAGCAAGTAAAACCTCTACAAATGTGGTAAAATCGATAAGGATCAA 
TTCGGCTTCAGGTACCGTCGACGATGTAGGTCACGGTCTCGAAGCCGCGGTGCGGGTGCCAG 
GGCGTGCCCTTGGGCTCCCCGGGGGCGTACTCCACCTCACCCATCTGGTCCATCATGATGAA 
CGGGTCGAGGTGGCGGTAGTTGATCCCGGCGAACGCGCGGCGCACCGGGAAGCCCTCGCCCT 
CGAAACCGCTGGGCGCGGTGGTCACGGTGAGCACGGGACGTGCGACGGCGTCGGCGGGTGCG 
GATACGCGGGGCAGCGTCAGCGGGTTCTCGACGGTCACGGCGGGCATGTCGACAGCCGAATT 
GATCCGTCGACCGATGCCCTTGAGAGCCTTCAACCCAGTCAGCTCCTTCCGGTGGGCGCGGG 
GCATGACTATCGTCGCCGCACTTATGACTGTCTTCTTTATCATGCAACTCGTAGGACAGGTG 
CCGGCAGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGC 
GAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCA 
GGAAAGAACATG 
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pOM IFN-lns-CMV-pur-attB (SEQ ID NO: 8) 

GGCCGCCACCGCGGTGGAGCTCCAATTCGCCCTATAGTGAGTCGTATTACAATTCACTGGCC 
GTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACCC2\ACTTAATCGCCTTGCAGC 
ACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAAC 
AGTTGCGCAGCCTGAATGGCGAATGGGACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGT 

GTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGC 
TTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGC 
TCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTGATTAGGGT 
GATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTC 
CACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGTCT 
ATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATT 
TAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGCTTACAATTTAGGTGGCACTTTT 
CGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCC 
GCTCATGAGACAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTA 
TTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCT 
CACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTA 
CATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTC 
CAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGG 
CAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGT 
CACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCA 
TGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACC 
GCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAA 
TGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGC 
GCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATG 
GAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGC 
TGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATG 
GTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGA 
AATAGACAGATCGCTGAGATAGGTGCC'i'CACTGATTAAGCATTGGTAACTGTCAGACCAAGT 
TTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGA 
AGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCG 
TCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTG 
CTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTAC 
CAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTA 
GTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCT 
GCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACT 
CAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAG 
CCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAG 
CGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAG 
GAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTT 
CGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAA 
AAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGT 
TCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGAT 
ACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCG 
CCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCACGACA 
GGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTAGCTCACTCAT 
TAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAATTGTGAGCGG 
ATAACAATTTCACACAGGAAACAGCTATGACCATGATTACGCCAAGCTCGAAATTAACCCTC 
ACTAAAGGGAACAAAAGCTGGGTACCGGGCCCCCCCTCGACTAGAGGGACAGCCCCCCCCCA 
AAGCCCCCAGGGATGTAATTACGTCCCTCCCCCGCTAGGGGGCAGCAGCGAGCCGCCCGGGG 
CTCCGCTCCGGTCCGGCGCTCCCCCCGCATCCCCGAGCCGGCAGCGTGCGGGGACAGCCCGG 
GCACGGGGAAGGTGGCACGGGATCGCTTTCCTCTGAACGCTTCTCGCTGCTCTTTGAGCCTG 
CAGACACCTGGGGGGATACGGGGAAAAAGCTTTAGGCTGAAAGAGAGATTTAGAATGACAGA 
ATCATAGAACGGCCTGGGTTGCAAAGGAGCACAGTGCTCATCCAGATCCAACCCCCTGCTAT 
GTGCAGGGTCATCAACCAGCAGCCCAGGCTGCCCAGAGCCACATCCAGCCTGGCCTTGAATG 
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CCTGCAGGGATGGGGCATCCACAGCCTCCTTGGGCAACCTGTTCAGTGCGTCACCACCCTCT 

GGGGGAAAAACTGCCTCCTCATATCCAACCCAAACCTCCCCTGTCTCAGTGTAAAGCCATTC 

CCCCTTGTCCTATCAAGGGGGAGTTTGCTGTGACATTGTTGGTCTGGGGTGACACATGTTTG 

CCAATTCAGTGCATCACGGAGAGGCAGATCTTGGGGATAAGGAAGTGCAGGACAGCATGGAC 

GTGGGACATGCAGGTGTTGAGGGCTCTGGGACACTCTCCAAGTCACAGCGTTCAGAACAGCC 

TTAAGGATAAGAAGATAGGATAGAAGGACAAAGAGCAAGTTAAAACCCAGCATGGAGAGGAG 

CACAAAAAGGCCACAGACACTGCTGGTCCCTGTGTCTGAGCCTGCATGTTTGATGGTGTCTG 

GATGCAAGCAGAAGGGGTGGAAGAGCTTGCCTGGAGAGATACAGCTGGGTCAGTAGGACTGG 

GACAGGCAGCTGGAGAATTGCCATGTAGATGTTCATACAATCGTCAAATCATGAAGGCTGGA 

AAAGCCCTCCAAGATCCCCAAGACCAACCCCAACCCACCCACCGTGCCCACTGGCCATGTCC 

CTCAGTGCCACATCCCCACAGTTCTTCATCACCTCCAGGGACGGTGACCGCCCCACCTCCGT 

GGGCAGCTGTGCCACTGCAGCACCGCTCTTTGGAGAAGGTAAATCTTGCTAAATCCAGCCCG 

ACCCTCCCCTGGCACAACGTAAGGCCATTATCTCTCATCCAACTCCAGGACGGAGTCAGTGA 

GGATGGGGCTCTAGTCGAGGTCGACGGTATCGATAAGCTTGATTAGGCAGAGCAATAGGACT 

CTCAACCTCGTGAGTATGGCAGCATGTTAACTCTGCACTGGAGTCCAGCGTGGGAAACAATC 

TGCCTTGCACATGAGTCTTCGTGGGCCAATATTCCCCAACGGTTTTCCTTCAGCTTGTCTTG 

TCTCCTAAGCTCTCAAAACACCTTTTTGGTGAATAAACTCACTTGGCAACGTTTATCTGTCT 

TACCTTAGTGTCACGTTTCATCCCTATTCCCCTTTCTCCTCCTCCGTGTGGTACACAGTGGT 

GCACACTGGTTCTTCTGTTGATGTTCTGCTCTGACAGCCAATGTGGGTAAAGTTCTTCCTGC 

CACGTGTCTGTGTTGTTTTCACTTCAAAAAGGGCCCTGGGCTCCCCTTGGAGCTCTCAGGCA 

TTTCCTTAATCATCACAGTCACGCTGGCAGGATTAGTCCCTCCTAAACCTTAGAATGACCTG • 

AACGTGTGCTCCCTCTTTGTAGTCAGTGCAGGGAGACGTTTGCCTCAAGATCAGGGTCCATC 

TCACCCACAGGGCCATTCCCAAGATGAGGTGGATGGTTTACTCTCACAAAAAGTTTTCTTAT 

GTTTGGCTAGAAAGGAGAACTCACTGCCTACCTGTGAATTCCCCTAGTCCTGGTTCTGCTGC 

CACTGCTGCCTGTGCAGCCTGTCCCATGGAGGGGGCAGCAACTGCTGTCACAAAGGTGATCC 

CACCCTGTCTCCACTGAAATGACCTCAGTGCCACGTGTTGTATAGGGTATAAAGTACGGGAG 

GGGGATGCCCGGCTCCCTTCAGGGTTGCAGAGCAGAAGTGTCTGTGTATAGAGTGTGTCTTA 

ATCTATTAATGTAACAGAACAACTTCAGTCCTAGTGTTTTGTGGGCTGGAATTGCCCATGTG 

GTAGGGACAGGCCTGCTAAATCACTGCAATCGCCTATGTTCTGAAGGTATTTGGGAAAGAAA 

GGGATTTGGGGGATTGCCTGTGATTGGCTTTAATTGAATGGCAAATCACAGGAAAGCAGTTC 

TGCTCAACAGTTGGTTGTTTCAGCCAATTCTTGCAGCCAAAGAGCCGGGTGCCCAGCGATAT 

AATAGTTGTCACTTGTGTCTGTATGGATGACAGGGAGGTAGGGTGACCTGAGGACCACCCTC 

CAGCTTCTGCTAGCGTAGGTACAGTCACCACCTCCAGCTCCACACGAGTCCCATCGTGGTTT 

ACCAAAGAAACACAATTATTTGGACCAGTTTGGAAAGTCACCCGCTGAATTGTGAGGCTAGA 

TTAATAGAGCTGAAGAGCAAATGTTCCCAACTTGGAGATACTAGTTGGTATTAGTATCAGAG 

GAACAGGGCCATAGCACCTCCATGCTATTAGATTCCGGCTGGCATGTACTTTTCAAGATGAT 

TTGTAACTAACAATGGCTTATTGTGCTTGTCTTAAGTCTGTGTCCTAATGTAAATGTTCCTT 

TGGTTTATATAACCTTCTTGCCATTTGCTCTTCAGGTGTTCTTGCAGAACACTGGCTGCTTT 

AATCTAGTTTAACTGTTGCTTGATTATTCTTAGGGATAAGATCTGAATAAACTTTTTGTGGC 

TTTGGCAGACTTTAGCTTGGGCTTAGCTCCCACATTAGCTTTTGCTGCCTTTTCTGTGAAGC 

TATCAAGATCCTACTCAATGACATTAGCTGGGTGCAGGTGTACCAAATCCTGCTCTGTGGAA 

CACATTGTCTGATGATACCGAAGGCAAACGTGAACTCAAAGAGGCACAGAGTTAAGAAGAAG 

TCTGTGCAATTCAGAGGAAAAGCCAAAGTGGCCATTAGACACACTTTCCATGCAGCATTTGC 

CAGTAGGTTTCATATA2\AACTACAAAATGGAATAAACCACTACAAATGGGAAAAGCCTGATA 

CTAGAATTTAAATATTCACCCAGGCTCAAGGGGTGTTTCATGGAGTAATATCACTCTATAAA 

AGTAGGGCAGCCAATTATTCACAGACAAAGCTTTTTTTTTTCTGTGCTGCAGTGCTGTTTTT 

CGGCTGATCCAGGGTTACTTATTGTGGGTCTGAGAGCTGAATGATTTCTCCTTGTGTCATGT 

TGGTGAAGGAGATATGGCCAGGGGGAGATGAGCATGTTCAAGAGGAAACGTTGCATTTTGGT 

GGCTTGGGAGAAAGGTAGAACGATATCAGGTCCATAGTGTCACTAAGAGATCTGAAGGATGG 

TTTTACAGAACAGTTGACTTGGCTGGGTGCAGGCTTGGCTGTAAATGGATGGAAGGATGGAC 

AGATGGGTGGACAGAGATTTCTGTGCAGGAGATCATCTCCTGAGCTCGGTGCTTGACAGACT 

GCAGATCCATCCCATAACCTTCTCCAGCATGAGAGCGCGGGGAGCTTTGGTACTGTTCAGTC 

TGCTGCTTGTTGCTTCCTGGGTGCACAGTGGTGATTTTCTTACTCACACAGGGCAAAAACCT 

GAGCAGCTTCAAAGTGAACAGGTTGCTCTCATAGGCCATTCAGTTGTCAAGATGAGGTTTTT 
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GGTTTCTTGTTTTGTAAGGTGGGAAGAAGCACTGAAGGATCAGTTGCGAGGGCAGGGGTTTA 

GCACTGTTCAGAGAAGTCTTATTTTAACTCCTCTCATGAACAAAAAGAGATGCAGGTGCAGA 

TTCTGGCAAGCATGCAGTGAAGGAGAAAGCCCTGAATTTCTGATATATGTGCAATGTTGGGC 

ACCTAACATTCCCCGCTGAAGCACAGCAGCTCCAGCTCCATGCAGTACTCACAGCTGGTGCA 

GCCCTCGGCTCCAGGGTCTGAGCAGTGCTGGGACTCACGAGGTTCCATGTCTTTCACACTGA 

TAATGGTCCAATTTCTGGAATGGGTGCCCATCCTTGGAGGTCCCCAAGGCCAGGCTGGCTGC 

GTCTCCGAGCAGCCCGATCTGGTGGTGAGTAGCCAGCCCATGGCAGGAGTTAGAGCCTGATG 

GTCTTTAAGGTCCCTTCCAACCTAAGCCATCCTACGATTCTAGGAATCATGACTTGTGAGTG 

TGTATTGCAGAGGCAATATTTTAAAGTTATAAATGTTTTCTCCCCTTCCTTGTTTGTCAAAG 

TTATCTTGATCGCCTTATCAATGCTTTTGGAGTCTCCAGTCATTTTTCTTACAMCAAAAAGA 

GGAGGAAGAATGAAGAGAATCATTTAATTTCTTGATTGAATAGTAGGATTCAGAAAGCTGTA 

CGTAATGCCGTCTCTTTGTATCGAGCTGTAAGGTTTCTCATCATTTATCAGCGTGGTACATA 

TCAGCACTTTTCCATCTGATGTGGAAAAAAAAATCCTTATCATCTACAGTCTCTGTACCTAA 

ACATCGCTCAGACTCTTTACCAAAAAAGCTATAGGTTTTAAAACTACATCTGCTGATAATTT 

GCCTTGTTTTAGCTCTTCTTCCATATGCTGCGTTTGTGAGAGGTGCGTGGATGGGCCTAAAC 

TCTCAGCTGCTGAGCTTGATGGGTGCTTAAGAATGAAGCACTCACTGCTGAAACTGTTTTCA 

TTTCACAGGAATGTTTTAGTGGCATTGTTTTTATAACTACATATTCCTCAGATAAATGAAAT 

CCAGAAATAATTATGCAAACTCACTGCATCCGTTGCACAGGTCTTTATCTGCTAGCAAAGGA 

AATAATTTGGGGATGGCAAAAACATTCCTTCAGACATCTATATTTAAAGGAATATAATCCTG 

GTACCCACCCACTTCATCCCTCATTATGTTCACACTCAGAGATACTCATTCTCTTGTTGTTA 

TCATTTGATAGCGTTTTCTTTGGTTCTTTGCCACGCTCTGGGCTATGGCTGCACGCTCTGCA 

CTGATCAGCAAGTAGATGCGAGGGAAGCAGCAGTGAGAGGGGCTGCCCTCAGCTGGCACCCA 

GCCGCTCAGCCTAGGAGGGGACCTTGCCTTTCCACCAGCTGAGGTGCAGCCCTACAAGCTTA 

CACGTGCTGCGAGCAGGTGAGCAAAGGGAGTCTTCATGGTGTGTTTCTTGCTGCCCGGAAGC 

AAAACTTTACTTTCATTCATTCCCCTTGAAGAATGAGGAATGTTTGGAAACGGACTGCTTTA 

CGTTCAATTTCTCTCTTCCCTTTAAGGCTCAGCCAGGGGCCATTGCTGAGGACGGCATCGGG 

GCCCCCTGGACCAAATCTGTGGCACAGATGGTTTCACTTACATCAGTGGATGTGGGATCTGC 

GCCTGTAATGTGTCCTTCTGAAGGAAGGAACGTGCCTTCCAAGTGCCAGCCCCACAGCCCCC 

AGCCCCTCCCTGTGCTGCTCCAATTCATCTCCTCTTCCTCCTTCTCCCTTTGCTGTTTGTGC 

TCGGGTAGAAATCATGAAGATTTAGAAGAGAAAACAAAATAACTGGAGTGGAAACCCAGGTG 

ATGCAGTTCATTCAGCTGTCATAGGTTTGTCGTTGCTATAGGTCTGTATCAGAGATGCTARC 

ACCACTTTGCTGTCGGTGCTTAACTCGGGTGAACTCTCCTTCACTCGCATCATTTGCGGGCC 

TTATTTACATCCCCAGCATCCATCACCCTCTGGGAAAATGGGCGCACTGGATCTCTAATGGA 

AGACTTTCCCTCTTTCAGAGCCTGTGGGATGTGCAGTGACAAGAAACGTGGAGGGGCTGAGC 

AGCAGCACTGCCCCCAGGGAGCAGGAGCGGATGCCATCGGTGGCAGCATCCCAAATGATGTC 

AGCGGATGCTGAGCAGGCAGCGGACGAACGGACAGAAGCGATGCGTACACCTTCTGTTGACA 

TGGTATTTGGCAGCGATTTAACACTCGCTTCCTAGTCCTGCTATTCTCCACAGGCTGCATTC 

AAATGAACGAAGGGAAGGGAGGCAAAAAGATGCAAAATCCGAGACAAGCAGCAGAAATATTT 

CTTCGCTACGGAAGCGTGCGCAAACAACCTTCTCCAACAGCACCAGAAGAGCACAGCGTAAC 

CTTTTTCAAGACCAGAAAAGGAAATTCACAAAGCCTCTGTGGATACCAGCGCGTTCAGCTCT 

CCTGATAGCAGATTTCTTGTCAGGTTGCGAATGGGGTATGGTGCCAGGAGGTGCAGGGACCA 

TATGATCATATACAGCACAGCAGTCATTGTGCATGTATTAATATATATTGAGTAGCAGTGTT 

ACTTTGCCAAAGCAATAGTTCAGAGATGAGTCCTGCTGCATACCTCTATCTTAAAACTAACT 

TATAAATAGTAAAACCTTCTCAGTTCAGCCACGTGCTCCTCTCTGTCAGCACCAATGGTGCT 

TCGCCTGCACCCAGCTGCAAGGAATCAGCCCGTGATCTCATTAACACTCAGCTCTGCAGGAT 

AAATTAGATTGTTCCACTCTCTTTTGTTGTTAATTACGACGGAACAATTGTTCAGTGCTGAT 

GGTCCTAATTGTCAGCTACAGAAAACGTCTCCATGCAGTTCCTTCTGCGCCAGCAAACTGTC 

CAGGCTATAGCACCGTGATGCATGCTACCTCTCACTCCATCCTTCTTCTCTTTCCCACCAGG 

GAGAGCTGTGTGTTTTCACTCTCAGCCACTCTGAACAATACCAAACTGCTACGCACTGCCTC 

CCTCGGAAAGAGAATCCCCTTGTTGCTTTTTTATTTACAGGATCCTTCTTAAAAAGCAGACC 

ATCATTCACTGCAAACCCAGAGCTTCATGCCTCTCCTTCCACAACCGAAAACAGCCGGCTTC 

ATTTGTCTTTTTTAAATGCTGTTTTCCAGGTGAATTTTGGCCAGCGTGTTGGCTGAGATCCA 

GGAGCACGTGTCAGCTTTCTGCTCTCATTGCTCCTGTTCTGCATTGCCTCTTTCTGGGGTTT 

CCAAGAGGGGGGGAGACTTTGCGCGGGGATGAGATAATGCCCCTTTTCTTAGGGTGGCTGCT 
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GGGCAGCAGAGTGGCTCTGGGTCACTGTGGCACCAATGGGAGGCACCAGTGGGGGTGTGTTT 
TGTGCAGGGGGGAAGCATTCACAGAATGGGGCTGATCCTGAAGCTTGCAGTCCAAGGCTTTG 
TCTGTGTACCCAGTGAAATCCTTCCTCTGTTACATAAAGCCCAGATAGGACTCAGAAATGTA 
GTCATTCCAGCCCCCCTCTTCCTCAGATCTGGAGCAGCACTTGTTTGCAGCCAGTCCTCCCC 
AAAATGCACAGACCTCGCCGAGTGGAGGGAGATGTAAACAGCGAAGGTTAATTACCTCCTTG 
TCAAAAACACTTTGTGGTCCATAGATGTTTCTGTCAATCTTACAAAACAGAACCGAGAGGCA 
GCGAGCACTGAAGAGCGTGTTCCCATGCTGAGTTAATGAGACTTGGCAGCTCGCTGTGCAGA 
GATGATCCCTGTGCTTCATGGGAGGCTGTAACCTGTCTCCCCATCGCCTTCACACCGCAGTG 
CTGTCCTGGACACCTCACCCTCCATAAGCTGTAGGATGCAGCTGCCCAGGGATCAAGAGACT 
TTTCCTAAGGCTCTTAGGACTCATCTTTGCCGCTCAGTAGCGTGCAGCAATTACTCATCCCA 
ACTATACTGAATGGGTTTCTGCCAGCTCTGCTTGTTTGTCAATAAGCATTTCTTCATTTTGC 
CTCTAAGTTTCTCTCAGCAGCACCGCTCTGGGTGACCTGAGTGGCCACCTGGAACCCGAGGG 
GCACAGCCACCACCTCCCTGTTGCTGCTGCTCCAGGGACTCATGTGCTGCTGGATGGGGGGA 
AGCATGAAGTTCCTCACCCAGACACCTGGGTTGCAATGGCTGCAGCGTGCTCTTCTTGGTAT 
GCAGATTGTTTCCAGCCATTACTTGTAGAAATGTGCTGTGGAAGCCCTTTGTATCTCTTTCT 
GTGGCCCTTCAGCAAAAGCTGTGGGAAAGCTCTGAGGCTGCTTTCTTGGGTCGTGGAGGAAT 
TGTATGTTCCTTCTTTAACAAAAATTATCCTTAGGAGAGAGCACTGTGCAAGCATTGTGCAC 
ATAAAACAATTCAGGTTGAAAGGGCTCTCTGGAGGTTTCCAGCCTGACTACTGCTGGAAGCA 
AGGCCAGGTTCAAAGATGGCTCAGGATGCTGTGTGCCTTCCTGATTATCTGTGCCACCAATG 
GAGGAGATTCACAGCCACTCTGCTTCCCGTGCCACTCATGGAGAGGAATATTCCCTTATATT 
CAGATAGAATGTTATCCTTTAGCTCAGCCTTCCCTATAACCCCATGAGGGAGCTGCAGATCC 
CCATACTCTCCCCTTCTCTGGGGTGAAGGCCGTGTCCCCCAGCCCCCCTTCCCACCCTGTGC 
CCTAAGCAGCCCGCTGGCCTCTGCTGGATGTGTGCCTATATGTCAATGCCTGTCCTTGCAGT 
CCAGCCTGGGACATTTAATTCATCACCAGGGTAATGTGGAACTGTGTCATCTTCCCCTGCAG 
GGTACAAAGTTCTGCACGGGGTCCTTTCGGTTCAGGAAAACCTTCACTGGTGCTACCTGAAT 
CAAGCTCTATTTAATAAGTTCATAAGCACATGGATGTGTTTTCCTAGAGATACGTTTTAATG 
GTATCAGTGATTTTTATTTGCTTTGTTGCTTACTTCAAACAGTGCCTTTGGGCAGGAGGTGA 
GGGACGGGTCTGCCGTTGGCTCTGCAGTGATTTCTCCAGGCGTGTGGCTCAGGTCAGATAGT 
GGTCACTCTGTGGCCAGAAGAAGGACAAAGATGGAAATTGCAGATTGAGTCACGTTAAGCAG 
GCATCTTGGAGTGATTTGAGGCAGTTTCATGAAAGAGCTACGACCACTTATTGTTGTTTTCC 
CCTTTTACAACAGAAGTTTTCATCAAAATAACGTGGCAAAGCCCAGGAATGTTTGGGAAAAG 
TGTAGTTAAATGTTTTGTAATTCATTTGTCGGAGTGCTACCAGCTAAGAAAAAAGTCCTACC 
TTTGGTATGGTAGTCCTGCAGAGAATACAACATCAATATTAGTTTGGAAAAAAACACCACCA 
CCACCAGAAACTGTAATGGAAAATGTAAACCAAGAAATTCCTTGGGTAAGAGAGAAAGGATG 
TCGTATACTGGCCAAGTCCTGCCCAGCTGTCAGCCTGCTGACCCTCTGCAGTTCAGGACCAT 
GAAACGTGGCACTGTAAGACGTGTCCCCTGCCTTTGCTTGCCCACAGATCTCTGCCCTTGTG 
CTGACTCCTGCACACAAGAGCATTTCCCTGTAGCCAAACAGCGATTAGCCATAAGCTGCACC 
TGACTTTGAGGATTAAGAGTTTGCAATTAAGTGGATTGCAGCAGGAGATCAGTGGCAGGGTT 
GCAGATGAAATCCTTTTCTAGGGGTAGCTAAGGGCTGAGCAACCTGTCCTACAGCACAAGCC 
AAACCAGCCAAGGGTTTTCCTGTGCTGTTCACAGAGGCAGGGCCAGCTGGAGCTGGAGGAGG 
TTGTGCTGGGACCCTTCTCCCTGTGCTGAGAATGGAGTGATTTCTGGGTGCTGTTCCTGTGG 
CTTGCACTGAGCAGCTCAAGGGAGATCGGTGCTCCTCATGCAGTGCCAAAACTCGTGTTTGA 
TGCAGAAAGATGGATGTGCACCTCCCTCCTGCTAATGCAGCCGTGAGCTTATGAAGGCAATG 
AGCCCTCAGTGCAGCAGGAGCTGTAGTGCACTCCTGTAGGTGCTAGGGAAAATCTCTGGTTC 
CCAGGGATGCATTCATAAGGGCAATATATCTTGAGGCTGCGCCAAATCTTTCTGAAATATTC 
ATGCGTGTTCCCTTAATTTATAGAAACAAACACAGCAGAATAATTATTCCAATGCCTCCCCT 
CGAAGGAAACCCATATTTCCATGTAGAAATGTAACCTATATACACACAGCCATGCTGCATCC 
TTCAGAACGTGCCAGTGCTCATCTCCCATGGCAAAATACTACAGGTATTCTCACTATGTTGG 
ACCTGTGAAAGGAACCATGGTAAGAAACTTCGGTTAAAGGTATGGCTGCAAAACTACTCATA 
CCAAAACAGCAGAGCTCCAGACCTCCTCTTAGGAAAGAGCCACTTGGAGAGGGATGGTGTGA 
AGGCTGGAGGTGAGAGACAGAGCCTGTCCCAGTTTTCCTGTCTCTATTTTCTGAAACGTTTG 
CAGGAGGAAAGGACAACTGTACTTTCAGGCATAGCTGGTGCCCTCACGTAAATAAGTTCCCC 
GAACTTCTGTGTCATTTGTTCTTAAGATGCTTTGGCAGAACACTTTGAGTCAATTCGCTTAA 
CTGTGACTAGGTCTGTAAATAAGTGCTCCCTGCTGATAAGGTTCAAGTGACATTTTTAGTGG 
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TATTTGACAGCATTTACCTTGCTTTCAAGTCTTCTACCAAGCTCTTCTATACTTAAGCAGTG 
AAACCGCCAAGAAACCCTTCCTTTTATCAAGCTAGTGCTAAATACCATTAACTTCATAGGTT 
AGATACGGTGCTGCCAGCTTCACCTGGCAGTGGTTGGTCAGTTCTGCTGGTGACAAAGCCTC 
CCTGGCCTGTGCTTTTACCTAGAGGTGAATATCCAAGAATGCAGAACTGCATGGAAAGCAGA 
GCTGCAGGCACGATGGTGCTGAGCCTTAGCTGCTTCCTGCTGGGAGATGTGGATGCAGAGAC 
GAATGAAGGACCTGTCCCTTACTCCCCTCAGCATTCTGTGCTATTTAGGGTTCTACCAGAGT 
CCTTAAGAGGTTTTTTTTTTTTTTGGTCCAAAAGTCTGTTTGTTTGGTTTTGACCACTGAGA 
GCATGTGACACTTGTCTCAAGCTATTAACCAAGTGTCCAGCCAAAATCAATTGCCTGGGAGA 
CGCAGACCATTACCTGGAGGTCAGGACCTCAATAAATATTACCAGCCTCATTGTGCCGCTGA 
CAGATTCAGCTGGCTGCTCCGTGTTCCAGTCCAACAGTTCGGACGCCACGTTTGTATATATT 
TGCAGGCAGCCTCGGGGGGACCATCTCAGGAGCAGAGCACCGGCAGCCGCCTGCAGAGCCGG 
GCAGTACCTCACCATGGCTTTGACCTTTGCCTTACTGGTGGCTCTCCTGGTGCTGAGCTGCA 
AGAGCAGCTGCTCTGTGGGCTGCGATCTGCCTCAGACCCACAGCCTGGGCAGCAGGAGGACC 
CTGATGCTGCTGGCTCAGATGAGGAGAATCAGCCTGTTTAGCTGCCTGAAGGATAGGCACGA 
TTTTGGCTTTCCTCAAGAGGAGTTTGGCAACCAGTTTCAGAAGGCTGAGACCATCCCTGTGC 
TGCACGAGATGATCCAGCAGATCTTTAACCTGTTTAGCACCAAGGATAGCAGCGCTGCTTGG 
GATGAGACCCTGCTGGATAAGTTTTACACCGAGCTGTACCAGCAGCTGAACGATCTGGAGGC 
TTGCGTGATCCAGGGCGTGGGCGTGACCGAGACCCCTCTGATGAAGGAGGATAGCATCCTGG 
CTGTGAGGAAGTACTTTCAGAGGATCACCCTGTACCTGAAGGAGAAGAAGTACAGCCCCTGC 
GCTTGGGAAGTCGTGAGGGCTGAGATCATGAGGAGCTTTAGCCTGAGCACCAACCTGCAAGA 
GAGCTTGAGGTCTAAGGAGTAAAAAGTCTAGAGTCGGGGCGGCCGGCCGCTTCGAGCAGACA 
TGATAAGATACATTGATGAGTTTGGACAAACCAGAACTAGAATGCAGTGAAAAAAATGCTTT 
ATTTGTGAAATTTGTGATGCTATTGCTTTATTTGTAACCATTATAAGCTGCAATAAACAAGT 
TAACAACAACAATTGCATTCATTTTATGTTTCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTT 
AAAGCAAGTAAAACCTCTACAAATGTGGTAAAATCGATACCGTCGACCTCGACTAGAGCGGC 

cactaacatacgctctccatcaaaacaaaacgaaacaaaacaaactagcaaaataggctgtc 

cccagtgcaagtgcaggtgccagaacatttctctatcgataggtaccgagctcttacgcgtg 
ctagcccrcgagcaggatctatacattgaatcaatattggcaattagccatattagtcattg 
gttatatagcataaatcaatattggctattggccattgcatacgttgtatctatatcataat 
atgtacatttatattggctcatgtccaatatgaccgccatgttgacattgattattgactag 
ttattaatagtaatcaattacggggtcattagttcatagcccatatatggagttccgcgtta 
cataacttacggtaaatggcccgcctggctgaccgcccaacgacccccgcccattgacgtca 
ataatgacgtatgttcccatagtaacgccaatagggactttccattgacgtcaatgggtgga 
gtatttacggtaaactgcccacttggcagtacatcaagtgtatcatatgccaagtccgcccc 
ctattgacgtcaatgacggtaaatggcccgcctggcattatgcccagtacatgaccttacgg 
gactttcctacttggcagtacatctacgtattagtcatcgctattaccatggtgatgcggtt 
ttggcagtacatcaatgggcgtggatagcggtttgactcacggggatttccaagtctccacc 
ccattgacgtcaatgggagtttgttttggcaccaaaatc2\acgggactttccaaaatgtcgt 
aacaactccgccccattgacgcaaatgggcggtaggcgtgtacggtgggaggtctatataag 
cagagctcgtttagtgaaccgtcagatcgcctggagacgccatccacgctgttttgacctcc 
atagaagacaccgggaccgatccagcctcccctcgaagctcgactctaggggctcgagatct 

GCGATCTAAGTAAGCTTGCATGCCTGCAGGTCGGCCGCCACGACCGGTGCCGCCACCATCCC 

ctgacccacgcccctgacccctcacaaggagacgaccttccatgaccgagtacaagcccacg 
gtgcgcctcgccacccgcgacgacgtcccccgggccgtacgcaccctcgccgccgcgttcgc 
cgactaccccgccacgcgccacaccgtcgacccggaccgccacatcgagcgggtcaccgagc 
tgcaagaactcttcctcacgcgcgtcgggctcgacatcggcaaggtgtgggtcgcggacgac 
ggcgccgcggtggcggtctggaccacgccggagagcgtcgaagcgggggcggtgttcgccga 
gatcggcccgcgcatggccgagtt.gagcggttcccggctggccgcgcagcaacagatggaag 
gcctcctggcgccgcaccggcccaaggagcccgcgtggttcctggccaccgtcggcgtctcg 
cccgaccaccagggcaagggtctgggcagcgccgtcgtgctccccggagtggaggcggccga 
gcgcgccggggtgcccgccttcctggagacctccgcgccccgcaacctccccttctacgagc 
ggctcggcttcaccgtcaccgccgacgtcgaggtgcccgaaggaccgcgcacctggtgcatg 
acccgcaagcccggtgcctgacgcccgccccacgacccgcagcgcccgaccgaaaggagcgc 
acgaccccatggctccgaccgaagccgacccgggcggccccgccgaccccgcacccgccccc 
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GAGGCCCACCGACTCTAGAGTCGGGGCGGCCGGCCGCTTCGAGCAGACATGATAAGATACAT 
TGATGAGTTTGGACAAACCACAACTAGAATGCAGTGAAAAAAATGCTTTATTTGTGAAATTT 
GTGATGCTATTGCTTTATTTGTAACCATTATAAGCTGCAATAAACAAGTTAACAACAACAAT 
TGCATTCATTTTATGTTTCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTAAAGCAAGTAAAA 
CCTCTACAAATGTGGTAAAATCGATAAGGATCAATTCGGCTTCAGGTACCGTCGACGATGTA 
GGTCACGGTCTCGAAGCCGCGGTGCGGGTGCCAGGGCGTGCCCTTGGGCTCCCCGGGCGCGT 
ACTCCACCTCACCCATCTGGTCCATCATGATGAACGGGTCGAGGTGGCGGTAGTTGATCCCG 
GCGAACGCGCGGCGCACCGGGAAGCCCTCGCCCTCGAAACCGCTGGGCGCGGTGGTCACGGT 
GAGCACGGGACGTGCGACGGCGTCGGCGGGTGCGGATACGCGGGGCAGCGTCAGCGGGTTCT 
CGACGGTCACGGCGGGCATGTCGACAGCCGAATTGATCCGTCGACCGATGCCCTTGAGAGCC 
TTCAACCCAGTCAGCTCCTTCCGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGAC 
TGTCTTCTTTATCATGCAACTCGTAGGACAGGTGCCGGCAGC 



Fig. 16 
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pRSV-C31int (SEQ ID NO: 9) 

CTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCC 

GCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCT 

CACTCAZVAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATG 

TGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTC 

CATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGA 

AACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCT 

CCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTG 

GCGCTTTCTCAATGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAG 

CTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTAT 

CGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAAC 

AGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAAC 

TACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTC 

GGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTT 

TTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATC 

TTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATG 

AGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCA 

ATCT7\AAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCA 

CCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAG 

ATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGAC 

CCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGC 

AGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCT 

AGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATC 

GTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGG 

CGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATC 

GTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAAT 

TCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAG 

TCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGAT 

AATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGG 

CGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCA 

CCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGA 

AGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTC 

TTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATA 

TTTG2!^TGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTG 

CCACCTGACGTCGACGGATCGGGAGATCTCCCGATCCCCTATGGTCGACTCTCAGTACAA 

TCTGCTCTGATGCCGCATAGTTAAGCCAGTATCTGCTCCCTGCTTGTGTGTTGGAGGTCG 

CTGAGTAGTGCGCGAGCAAAATTTAAGCTACAACAAGGCAAGGCTTGACCGACAATTGCA 

TGAAGAATCTGCTTAGGGTTAGGCGTTTTGCGCTGCTTCGCGATGTACGGGCCAGATATA 

CGCGTGCTAGGGGTCTAGGATCGATTCTAGGAATTCTCTAGCCGCGGTCTAGGGATCCCG 

GCGCGTATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGTATCT 

GCTCCCTGCTTGTGTGTTGGAGGTCGCTGAGTAGTGCGCGAGCAAAATTTAAGCTACAAC 

AAGGCAAGGCTTGACCGACAATTGCATGAAGAATCTGCTTAGGGTTAGGCGTTTTGCGCT 

GCTTCGCGATGTACGGGCCAGATATACGCGTATCTGAGGGGACTAGGGTGTGTTTAGGCG 

AAAAGCGGGGCTTCGGTTGTACGCGGTTAGGAGTCCCCTCAGGATATAGTAGTTTCGCTT 

TTGCATAGGGAGGGGGAAATGTAGTCTTATGCAATACACTTGTAGTCTTGCAACATGGTA 

ACGATGAGTTAGCAACATGCCTTACAAGGAGAGAAAAAGCACCGTGCATGCCGATTGGTG 

GAAGTAAGGTGGTACGATCGTGCCTTATTAGGAAGGCAACAGACAGGTCTGACATGGATT 

GGACGAACCACTGAATTCCGCATTGCAGAGATAATTGTATTTAAGTGCCTAGCTCGATAC 

AATAAACGCCATTTGACCATTCACCACATTGGTGTGCACCTCCAAGCTTGCATGCCTGCA 

GGTACCGGTCCGGAATTCCCGGGTCGACGAGCTCACTAGTCGTAGGGTCGCCGACATGAC 

ACAAGGGGTTGTGACCGGGGTGGACACGTACGCGGGTGCTTACGACCGTCAGTCGCGCGA 

GCGCGAGAATTCGAGCGCAGCAAGCCCAGCGACACAGCGTAGCGCCAACGAAGACAAGGC 

GGCCGACCTTCAGCGCGAAGTCGAGCGCGACGGGGGCCGGTTCAGGTTCGTCGGGCATTT 

CAGCGAAGCGCCGGGCACGTCGGCGTTCGGGACGGCGGAGCGCCCGGAGTTCGAACGCAT 
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CCTGAACGAATGCCGCGCCGGGCGGCTCAACATGATCATTGTCTATGACGTGTCGCGCTT 

CTCGCGCCTGAAGGTCATGGACGCGATTCCGATTGTCTCGGAATTGCTCGCCCTGGGCGT 

GACGATTGTTTCCACTCAGGAAGGCGTCTTCCGGCAGGGAAACGTCATGGACCTGATTCA 

CCTGATTATGCGGCTCGACGCGTCGCACAAAGAATCTTCGCTGAAGTCGGCGAAGATTCT 

CGACACGAAGAACCTTCAGCGCGAATTGGGCGGGTACGTCGGCGGGAAGGCGCCTTACGG 

CTTCGAGCTTGTTTCGGAGACGAAGGAGATCACGCGCAACGGCCGAATGGTCAATGTCGT 

CATCAACAAGCTTGCGCACTCGACCACTCCCCTTACCGGACCCTTCGAGTTCGAGCCCGA 

CGTAATCCGGTGGTGGTGGCGTGAGATCAAGACGCACAAACACCTTCCCTTCAAGCCGGG 

CAGTCAAGCCGCCATTCACCCGGGCAGCATCACGGGGCTTTGTAAGCGCATGGACGCTGA 

CGCCGTGCCGACCCGGGGCGAGACGATTGGGAAGAAGACCGCTTCAAGCGCCTGGGACCC 

GGCAACCGTTATGCGAATCCTTCGGGACCCGCGTATTGCGGGCTTCGCCGCTGAGGTGAT 

CTACAAGAAGAAGCCGGACGGCACGCCGACCACGAAGATTGAGGGTTACCGCATTCAGCG 

CGACCCGATCACGCTCCGGCCGGTCGAGCTTGATTGCGGACCGATCATCGAGCCCGCTGA 

GTGGTATGAGCTTCAGGCGTGGTTGGACGGCAGGGGGCGCGGCAAGGGGCTTTCCCGGGG 

GCAAGCCATTCTGTCCGCCATGGACAAGCTGTACTGCGAGTGTGGCGCCGTCATGACTTC 

GAAGCGCGGGGAAGAATCGATCAAGGACTCTTACCGCTGCCGTCGCCGGAAGGTGGTCGA 

CCCGTCCGCACCTGGGCAGCACGAAGGCACGTGCAACGTCAGCATGGCGGCACTCGACAA 

GTTCGTTGCGGAACGCATCTTCAACAAGATCAGGCACGCCGAAGGCGACGAAGAGACGTT 

GGCGCTTCTGTGGGAAGCCGCCCGACGCTTCGGCAAGCTCACTGAGGCGCCTGAGAAGAG 

CGGCGAACGGGCGAACCTTGTTGCGGAGCGCGCCGACGCCCTGAACGCCCTTGAAGAGCT 

GTACGAAGACCGCGCGGCAGGCGCGTACGACGGACCCGTTGGCAGGAAGCACTTCCGGAA 

GCAACAGGCAGCGCTGACGCTCCGGCAGCAAGGGGCGGAAGAGCGGCTTGCCGAACTTGA 

AGCCGCCGAAGCCCCGAAGCTTCCCCTTGACCAATGGTTCCCCGAAGACGCCGACGCTGA 

CCCGACCGGCCCTAAGTCGTGGTGGGGGCGCGCGTCAGTAGACGACAAGCGCGTGTTCGT 

CGGGCTCTTCGTAGACAAGATCGTTGTCACGAAGTCGACTACGGGCAGGGGGCAGGGAAC 

GCCCATCGAGAAGCGCGCTTCGATCACGTGGGCGAAGCCGCCGACCGACGACGACGAAGA 

CGACGCCCAGGACGGCACGGAAGACGTAGCGGCGTAGCGAGACACCCGGATCCCTCGAGG 

GGCCCTATTCTATAGTGTCACCTAAATGCTAGAGCTCGCTGATCAGCCTCGACTGTGCCT 

TCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCCCGTGCCTTCCTTGACCCTGGAAGGT 

GCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAiy^TTGCATCGCATTGTCTGAGTAGG 

TGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGGACAGCAAGGGGGAGGATTGGGAAGAC 

AATAGCAGGCATGCTGGGGATGCGGTGGGCTCTATGGCTTCTGAGGCGGAAAGAACCAGG 

TGCCCAGTCATAGCCGAATAGCCTCTCCACCCAAGCGGCCGGAGAACCTGCGTGCAATCC 

ACTGGGGGCGCG 



Fig. 17 
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pCR-XL-TOPO-CMV-PUR-attB (SEQ ID NO: 10) 

AGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGC 

ACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTAGC 

TCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAA 

TTGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACCATGATTACGCCAAGCTAT 

TTAGGTGACGCGTTAGAATACTCAAGCTATGCATCAAGCTTGGTACCGAGCTCGGATCCA 

CTAGTAACGGCCGCCAGTGTGCTGGAATTCGCCCTTGGCCGCAATAAAATATCTTTATTT 

TCATTACATCTGTGTGTTGGTTTTTTGTGTGAATCGATAGTACTAACATACGCTCTCCAT 

CAAAACAAAACGAAACAAAACAAACTAGC2\AAATAGGCTGTCCCCAGTGCAAGTGCAGGT 

GCCAGAACATTTCTCTATCGATAGGTACCGAGCTCTTACGCGTGCTAGCCCTCGAGCAGG 

ATCTATACATTGAATCAATATTGGCAATTAGCCATATTAGTCATTGGTTATATAGCATAA 

ATCAATATTGGCTATTGGCCATTGCATACGTTGTATCTATATCATAATATGTACATTTAT 

ATTGGCTCATGTCCAATATGACCGCCATGTTGACATTGATTATTGACTAGTTATTAATAG 

TAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTT 

ACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATG 

ACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAGTAT 

TTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTCGGCCCCCT 

ATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTACGG 

GACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGG 

TTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTC 

CACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAA 

TGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTC 

TATATAAGCAGAGCTCGTTTAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGT 

TTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCCCTCGAAGCTCGACTCTAGG 

GGCTCGAGATCTGCGATCTAAGTAAGCTTGCATGCCTGCAGGTCGGCCGCCACGACCGGT 

GCCGCCACCATCCCCTGACCCACGCCCCTGACCCCTCACAAGGAGACGACCTTCCATGAC 

CGAGTACAAGCCCACGGTGCGCCTCGCCACCCGCGACGACGTCCCCCGGGCCGTACGCAC 

CCTCGCCGCCGCGTTCGCCGACTACCCCGCCACGCGCCACACCGTCGACCCGGACCGCCA 

CATCGAGCGGGTCACCGAGCTGCAAGAACTCTTCCTCACGCGCGTCGGGCTCGACATCGG 

CAAGGTGTGGGTCGCGGACGACGGCGCCGCGGTGGCGGTCTGGACCACGCCGGAGAGCGT 

CGAAGCGGGGGCGGTGTTCGCCGAGATCGGCCCGCGCATGGCCGAGTTGAGCGGTTCCCG 

GCTGGCCGCGCAGCAACAGATGGAAGGCCTCCTGGCGCCGCACCGGCCCAAGGAGCCCGC 

GTGGTTCCTGGCCACCGTCGGCGTCTCGCCCGACCACCAGGGCAAGGGTCTGGGCAGCGC 

CGTCGTGCTCCCCGGAGTGGAGGCGGCCGAGCGCGCCGGGGTGCCCGCCTTCCTGGAGAC 

CTCCGCGCCCCGCAACCTCCCCTTCTACGAGCGGCTCGGCTTCACCGTCACCGCCGACGT 

CGAGGTGCCCGAAGGACCGCGCACCTGGTGCATGACCCGCAAGCCCGGTGCCTGACGCCC 

GCCCCACGACCCGCAGCGCCCGACCGAAAGGAGCGCACGACCCCATGGCTCCGACCGAAG 

CCGACCCGGGCGGCCCCGCCGACCCCGCACCCGCCCCCGAGGCCCACCGACTCTAGAGTC 

GGGGCGGCCGGCCGCTTCGAGCAGACATGATAAGATACATTGATGAGTTTGGACAAACCA 

CAACTAGAATGCAGTGAAAAAAATGCTTTATTTGTGAAATTTGTGATGCTATTGCTTTAT 

TT GT AAC CAT T AT AAGCT GCAAT AAAC AAGT T AAC AAC AAC AATT GC AT T C AT TT T AT GT 

TTCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTAAAGCAAGTAAAACCTCTACAAATGTG 

GTAAAATCGATAAGGATCAATTCGGCTTCAGGTACCGTCGACGATGTAGGTCACGGTCTC 

GAAGCCGCGGTGCGGGTGCCAGGGCGTGCCCTTGGGCTCCCCGGGCGCGTACTCCACCTC 

ACCCATCTGGTCCATCATGATGAACGGGTCGAGGTGGCGGTAGTTGATCCCGGCGAACGC 

GCGGCGCACCGGGAAGCCCTCGCCCTCGAAACCGCTGGGCGCGGTGGTCACGGTGAGCAC 

GGGACGTGCGACGGCGTCGGCGGGTGCGGATACGCGGGGCAGCGTCAGCGGGTTCTCGAC 

GGTCACGGCGGGCATGTCGACAGCCGAATTGATCCGTCGACCGATGCCCTTGAGAGCCTT 

CAACCCAGTCAGCTCCTTCCGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGAC 

TGTCTTCTTTATCATGCAACTCGTAGGACAGGTGCCGGCAGCGCTCTTCCGCTTCCTCGC 

TCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGG 

CGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGAAGGGCGJU^T 

TCT GCAGAT AT CCAT CACACTGGCGGCCGCTCGAGCAT GC AT CTAGAGGGCCCAATTCGC 

CCTATAGTGAGTCGTATTACAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAA 

ACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTA 

ATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTATACGTACGGC 

AGTTTAAGGTTTACACCTATAAAAGAGAGAGCCGTTATCGTCTGTTTGTGGATGTACAGA 

GTGATATTATTGACACGCCGGGGCGACGGATGGTGATCCCCCTGGCCAGTGCACGTCTGC 
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TGTCAGATAAAGTCTCCCGTGAACTTTACCCGGTGGTGCATATCGGGGATGAAAGCTGGC 
GCATGATGACCACCGATATGGCCAGTGTGCCGGTCTCCGTTATCGGGGAAGAAGTGGCTG 
ATCTCAGCCACCGCGAAAATGACATCAAAAACGCCATTAACCTGATGTTCTGGGGAATAT 
AAATGTCAGGCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTCACGTAGAAAG 
CCAGTCCGCAGAAACGGTGCTGACCCCGGATGAATGTCAGCTACTGGGCTATCTGGACAA 
GGGAAAACGCAAGGGCAAAGAGAAAGCAGGTAGCTTGCAGTGGGCTTACATGGCGATAGC 
TAGACTGGGCGGTTTTATGGACAGCAAGCG^UVCCGGAATTGCCAGCTGGGGCGCCCTCTG 
GTAAGGTTGGGAAGCCCTGCAAAGTAAACTGGATGGCTTTCTCGCCGCCAAGGATCTGAT 
GGCGCAGGGGATCAAGCTCTGATCAAGAGACAGGATGAGGATCGTTTCGCATGATTGAAC 
AAGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGACT 
GGGCACAACAGACAATCGGCTGCTCTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGC 
GCCCGGTTCTTTTTGTCAAGACCGACCTGTCCGGTGCCCTGAATGAACTGCAAGACGAGG 
CAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTG 
TCACTGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGT 
CATCTCACCTTGCTCCTGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGC 
ATACGCTTGATCCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAG 
CACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAGG 
GGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGAGCATGCCCGACGGCGAGGATC 
TCGTCGTGACCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTT 
CTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTATCAGGACATAGCGTTGG 
CTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGGGCTGACCGCTTCCTCGTGCTTT 
ACGGTATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAGTTCT 
TCTGAATTATTAACGCTTACAATTTCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCG 
GTATTTCACACCGCATACAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTG 
TTTATTTTTCTAAATACATTCAiy^TATGTATCCGCTCATGAGACAATAACCCTGATAAAT 
GCTTCAATAATAGCACGTGAGGAGGGCCACCATGGCCAAGTTGACCAGTGCCGTTCCGGT 
GCTCACCGCGCGCGACGTCGCCGGAGCGGTCGAGTTCTGGACCGACCGGCTCGGGTTCTC 
CCGGGACTTCGTGGAGGACGACTTCGCCGGTGTGGTCCGGGACGACGTGACCCTGTTCAT 
CAGCGCGGTCCAGGACCAGGTGGTGCCGGACAACACCCTGGCCTGGGTGTGGGTGCGCGG 
CCTGGACGAGCTGTACGCCGAGTGGTCGGAGGTCGTGTCCACGAACTTCCGGGACGCCTC 
CGGGCCGGCCATGACCGAGATCGGCGAGCAGCCGTGGGGGCGGGAGTTCGCCCTGCGCGA 
CCCGGCCGGCAACTGCGTGCACTTCGTGGCCGAGGAGCAGGACTGACACGTGCTAAAACT 
TCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAAT 
CCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATC 
TTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCT 
ACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGG 
CTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCA 
CTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGC 
TGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGA 
TAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAAC 
GACCTACACCG2^CTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGA 
AGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAG 
GGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTG 
ACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAG 
CAACGCGGCCTTTTTACGGTTCCTGGGCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCC 
TGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGC 
TCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAG 
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SEQID NO: 11 

GACTAGTACTGACGGACACACCGAAGCCCCGGCGGCAACCCTCAGCGGATGCCCCGGGGCTT 
CACGTTTTCCCAGGTCAGAAGCGGTTTTCGGGAGTAGTGCCCCAACTGGGGTAACCTTTGAG 
TTCTCTCAGTTGGGGGCGTAGGGTCGCCGACATGACACAAGGGGTTGTGACCGGGGTGGACA 
CGTACGCGGGTGCTTACGACCGTCAGTCGCGCGAGCGCGACTAGTACA 
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025CIP7SEQ List.txt 
SEQUENCE LISTING 



<110> AviGeni cs , Inc 

<120> ARTIFICIAL CHROMOSOMES IN AVIANS 

<130> AVI-025CIP7PCT 

<160> 16 

<170> Patentln version 3.2 

<210> 1 

<211> 6230 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PlasmI d pCMV-31int 

<400> 1 



cattcgccat 


tcaggctgcg 


caactgttgg 


gaagggcgat 


cggtgcgggc 


ctcttcgcta 


60 


ttacgccagc 


caatacgcaa 


accgcctctc 


cccgcgcgtt 


ggccgattca 


ttaatgcagg 


120 


atcgatccag 


acatgataag 


atacattgat 


gaqtttggac 


aaaccacaac 


tagaatgcag 


180 


tgaaaaaaat 


gctttatttg 


tgaaatttgt 


gatgctattg 


ctttatttgt 


aaccattata 


240 


agctgcaata 


aacaagttaa 


caacaacaat 


tgcattcatt 


ttatgtttca 


ggttcagggg 


300 


gaggtgtggg 


aggtttttta 


aagcaagtaa 


aacctctaca 


aatgt ggtat 


ggctgattat 


350 


gatcatgaac 


agactgtgag 


gactgagggg 


cctgaaatga 


gccttgggac 


tgtgaatcta 


4 7.0 


aaatacacaa 


acaattagaa 


tcactagctc 


ctgtgtataa 


tattttcata 


aatcatactc 


480 


agtaagcaaa 


actctcaagc 


agcaagcata 


tgcagctagt 


ttaacacatt 


atacacttaa 


540 


aaattttata 


tttaccttag 


agctttaaat 


ctctgtaggt 


agtttgtcca 


attatgtcac 


600 


accacagaag 


taaggttcct 


tcacaaagat 


cccaagctag 


cttataatac 


gactcactat 


660 


agggagagag 


ctatgacgtc 


gcatgcacgc 


gtaagcttgg 


gcccctcgag 


ggatccgggt 


720 


gtctcgctac 


gccgctacgt 


cttccgtgcc 


gtcctgggcg 


tcgtcttcgt 


cgtcgtcggt 


780 


cggcggcttc 


gcccacgtga 


tcgaagcgcg 


cttctcgatg 


ggcgttccct 


gccccctgcc 


840 


cgtagtcgac 


ttcgtgacaa 


cgatcttgtc 


tacgaagagc 


ccgacgaaca 


cgcgcttgtc 


900 


gtctactgac 


gcgcgccccc 


accacgactt 


agggccggtc 


gggtcagcgt 


cggcgtcttc 


960 


ggggaaccat 


tggtcaaggg 


gaagcttcgg 


ggcttcggcg 


gcttcaagtt 


cggcaagccg 


1020 


ctcttccgcc 


ccttgctgcc 


ggagcgtcag 


cgctgcctgt 


tgcttccgga 


agtgcttcct 


1080 


gccaacgggt 


ccgtcgtacg 


cgcctgccgc 


gcggtcttcg 


tacagctctt 


caagggcgtt 


1140 


cagggcgtcg 


gcgcgctccg 


caacaaggtt 


cgcccgttcg 


ccgctcttct 


caggcgcctc 


1200 


agtgagcttg 


ccgaagcgtc 


gggcggcttc 


ccacagaagc 
Page 


gccaacgtct 
1 


cttcgtcgcc 


1260 
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025CIP7SEQ 


List .tXt 






ttcggcgtgc 


ctgatcttgt 


tgaagatgcg 


ttccgcaacg 


aacttgtcga 


gtgccgccat 


1320 


gctgacgttg 


cacgtgcctt 


cgtgctgccc 


aggtgcggac 


gggtcgacca 


ccttccggcg 


1380 


acggcagcgg 


taagagtcct 


tgatcgattc 


tt ccccgcgc 


ttcgaagtca 


tgacggcgcc 


1440 


acactcgcag 


tacagcttgt 


ccatggcgga 


cagaatggct 


tgcccccggg 


aaagcccctt 


1500 


gccgcgcccc 


ctgccgtcca 


accacgcctg 


aagctcatac 


cactcagcgg 


gctcgatgat 


1560 


cggtccgcaa 


tcaagctcga 


ccggccggag 


cgtgatcggg 


tcgcgctgaa 


tgcggtaacc 


1520 


ctcaatcttc 


gtggtcggcg 


tgccgtccgg 


cttcttcttg 


tagatcacct 


cagcggcgaa 


1580 


gcccgcaata 


cgcgggtccc 


gaaggattcg 


cataacggtt 


gccgggtccc 


aggcgcttga 


1740 


agcggtcttc 


ttcccaatcg 


tctcgccccg 


ggtcggcacg 


gcgtcagcgt 


ccatgcgctt 


1800 


acaaagcccc 


gtgatgctgc 


ccgggtgaat 


ggcggcttga 


ctgcccggct 


tgaagggaag 


1860 


gtgtttgtgc 


gtcttgatct 


cacgccacca 


ccaccggatt 


acgtcgggct 


cgaactcgaa 


1920 


gggtccggta 


aggggagtgg 


tcgagtgcgc 


aagcttgttg 


atgacgacat 


tgaccattcg 


1980 


gccgttgcgc 


gtgatctcct 


tcgtctccga 


aacaagctcg 


aagccgtaag 


gcgccttccc 


2040 


gccgacgtac 


ccgcccaatt 


cgcgctgaag 


gttcttcgtg 


tcgagaatct 


tcgccgactt 


2100 


cagcgaagat 


tctttgtgcg 


acgcgtcgag 


ccgcataatc 


aggtgaatca 


ggtccatgac 


2160 


gtttccctgc 


cggaagacgc 


cttcctgagt 


ggaaacaatc 


gtcacgccca 


gggcgagcaa 


2220 


ttccgagaca 


atcggaatcg 


cgtccatgac 


cttcaggcgc 


gagaagcgcg 


acacgtcata 


2280 


gacaatgatc 


atgttgagcc 


gcccggcgcg 


gcattcgttc 


aggatgcgtt 


cgaactccgg 


2340 


gcgctccgcc 


gtcccgaacg 


ccgacgtgcc 


cggcgcttcg 


ctgaaatgcc 


cgacgaacct 


2400 


gaaccggccc 


ccgtcgcgct 


cgacttcgcg 


ctgaaggtcg 


gccgccttgt 


cttcgttggc 


2460 


gctacgctgt 


gtcgctgggc 


ttgctgcgct 


cgaattctcg 


cgctcgcgcg 


actgacggtc 


2520 


gtaagcaccc 


gcgtacgtgt 


ccaccccggt 


cacaacccct 


tgtgtcatgt 


cggcgaccct 


2580 


acgactagtg 


agctcgtcga 


cccgggaatt 


ccggaccggt 


acctgcaggc 


gtaccttcta 


2640 


tagtgtcacc 


taaatagctt 


tttgcaaaag 


cctaggctag 


agtccggagg 


ctggatcggt 


2700 


cccggtgtct 


tctatggagg 


tcaaaacagc 


gtggatggcg 


tctccaggcg 


atctgacggt 


2760 


tcactaaacg 


agctctgctt 


atatagacct 


cccaccgtac 


acgcctaccg 


cccatttgcg 


2820 










■t- 1- 1- 1- 1- 1- rrrr 




288 0 


aactcccatt 


gacgtcaatg 


gggtggagac 


ttggaaatcc 


ccgtgagtca 


aaccgctatc 


2940 


cacgcccatt 


gatgtactgc 


caaaaccgca 


tcaccatggt 


aatagcgatg 


actaatacgt 


3000 


agatgtactg 


ccaagtagga 


aagtcccata 


aggtcatgta 


ctgggcataa 


tgccaggcgg 


3060 


gccatttacc 


gtcattgacg 


tcaatagggg 


gcgtacttgg 

Page 


catatgatac 
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acttgatgta 


3120 
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ctgccaagtg 


ggcagtttac 


cgtaaatact 


025CIP7SEQ 
ccacccattg 


List.tXt 
acgtcaatgg 


aaagtcccta 


3180 


ttggcgttac 


tatgggaaca 


tacgtcatta 


ttgacgtcaa 


tgggcggggg 


tcgttgggcg 


3240 


gtcagccagg 


cgggccattt 


accgtaagtt 


atgtaacgac 


ctgcacgatg 


ctgtttcctg 


3300 


tgtgaaattg 


ttatccgctc 


acaattccac 


acattatacg 


agccggaagc 


tataaagtgt 


3360 


aaagcctggg 


gtgcctaatg 


agtgaaaggg 


cctcgtatac 


gcctattttt 


ataggttaat 


3420 


gtcatgataa 


taatggtttc 


ttagacgtca 


ggtggcactt 


ttcggggaaa 


tgtgcgcgga 


3480 


acccctattt 


gtttattttt 


ctaaatacat 


tcaaatatgt 


atccgctcat 


gagacaataa 


3540 


ccctgataaa 


tgcttcaata 


atattgaaaa 


acgcgcgaat 


tgcaagctct 


gcattaatga 


3600 


atcggccaac 


gcgcggggag 


aggcggtttg 


cgtattgggc 


gctcttccgc 


ttcctcgctc 


3660 


actgactcgc 


tgcgctcggt 


cgttcggctg 


cggcgagcgg 


tatcagctca 


ctcaaaggcg 


3720 


gtaatacggt 


tatccacaga 


atcaggggat 


aacgcaggaa 


agaacatgtg 


agcaaaaggc 


3780 


cagcaaaagg 


ccaggaaccg 


taaaaaggcc 


gcgttgctgg 


cgtttttcca 


taggctccgc 


3840 


ccccctgacg 


agcatcacaa 


aaatcgacgc 


tcaagtcaga 


ggtggcgaaa 


cccgacagga 


3900 


ctataaagat 


accaggcgtt 


tccccctgga 


agctccctcg 


tgcgctctcc 


tgttccgacc 


3960 


ctgccgctta 


ccggatacct 


gtccgccttt 


ctcccttcgg 


gaagcgtggc 


gctttctcaa 


4020 


tgctcacgct 


gtaggtatct 


cagttcggtg 


taggtcgttc 


gctccaagct 


gggctgtgtg 


4080 


cacgaacccc 


ccgttcagcc 


cgaccgctgc 


gccttatccg 


gtaactatcg 


tcttgagtcc 


4140 


aacccggtaa 


gacacgactt 


atcgccactg 


gcagcagcca 


ctggtaacag 


gattagcaga 


4200 


gcgaggtatg 


taggcggtgc 


tacagagttc 


ttgaagtggt 


ggcctaacta 


cggctacact 


4260 


agaaggacag 


tatttggtat 


ctgcgctctg 


ctgaagccag 


tt acctt egg 


aaaaagagtt 


4320 


ggtagctctt 


gatccggcaa 


acaaaccacc 


gctggtagcg 


gtggtttttt 


tgtttgcaag 


4380 


cagcagatta 


cgcgcagaaa 


aaaaggatct 


caagaagatc 


ctttgatctt 


ttctacgggg 


4440 


tctgacgctc 


agtggaacga 


aaactcacgt 


taagggattt 


tggtcatgcc 


ataacttcgt 


4500 


atagcataca 


tt atacgaag 


ttatggcatg 


agattatcaa 


aaaggatctt 


cacctagatc 


4560 


cttttaaatt 


aaaaatgaag 


ttttaaatca 


atctaaagta 


tatatgagta 


aacttggtct 


4620 


gacagttacc 


aatgcttaat 


cagtgaggca 


cctat ctcag 


cgatctgtct 


atttcgttca 


4680 


tccatagttg 


cctgactccc 


cgtcgtgtag 


ataactacga 


tacgggaggg 


cttaccatct 


4740 


ggccccagtg 


ctgcaatgat 


accgcgagac 


ccacgctcac 


cggctccaga 


tttatcagca 


4800 


ataaaccagc 


cagccggaag 


ggccgagcgc 


agaagtggtc 


ctgcaacttt 


atccgcctcc 


4860 


atccagtcta 


ttaattgttg 


ccgggaagct 


agagtaagta 


gttcgccagt 


taatagtttg 


4920 


cgcaacgttg 


ttgccattgc 


tacaggcatc 


gtggtgtcac 


gctcgtcgtt 


tggtatggct 


4980 


tcattcagct 


ccggttccca 


acgatcaagg 


cgagttacat 

Page 
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gttgtgcaaa 


5040 
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List .txt 






aaagcggtta 


gctccttcgg 


tcctccgatc 


gttgtcagaa 


gtaagttggc 


cgcagtgtta 


5100 


tcactcatgg 


ttatggcagc 


actgcataat 


tctcttactg 


tcatgccatc 


cgtaagatgc 


5160 


ttttctgtga 


ctggtgagta 


ctcaaccaag 


tcattctgag 


aatagtgtat 


gcggcgaccg 


5220 


agttgctctt 


gcccggcgtc 


aatacgggat 


aataccgcgc 


cacatagcag 


aactttaaaa 


5280 


gtgctcatca 


ttggaaaacg 


ttcttcgggg 


cgaaaactct 


caaggatctt 


accgctgttg 


5340 


agatccagtt 


cgatgtaacc 


cactcgtgca 


cccaactgat 


cttcagcatc 


ttttactttc 


5400 


accagcgttt 


ctgggtgagc 


aaaaacagga 


aggcaaaatg 


ccgcaaaaaa 


gggaataagg 


5460 


gcgacacgga 


aatgttgaat 


actcatactc 


ttcctttttc 


aatattattg 


aagcatttat 


5520 


cagggttatt 


gtctcatgcc 


aggggtgggc 


acacatattt 


gataccagcg 


atccctacac 


5580 


agcacataat 


tcaatgcgac 


ttccctctat 


cgcacatctt 


agacctttat 


tctccctcca 


b640 


gcacacatcg 


aagctgccga 


gcaagccgtt 


ctcaccagtc 


caagacctgg 


catgagcgga 


5700 


tacatatttg 


aatgtattta 


gaaaaataaa 


caaatagggg 


ttccgcgcac 


atttccccga 


5760 


aaagtgccac 


ctgaaattgt 


aaacgttaat 


attttgttaa 


aattcgcgtt 


aaatttttgt 


5820 


taaatcagct 


cattttttaa 


ccaataggcc 


gaaatcggca 


aaatccctta 


taaatcaaaa 


5880 


gaatagaccg 


agatagggtt 


gagtgttgtt 


ccagtttgga 


acaagagtcc 


actattaaag 


5940 


aacgtggact 


ccaacgtcaa 


agggcgaaaa 


accgtctatc 


agggcgatgg 


cccactacgt 


6000 


gaaccatcac 


cctaatcaag 


ttttttgggg 


tcgaggtgcc 


gtaaagcact 


aaatcggaac 


6060 


cctaaaggga 


gcccccgatt 


tagagcttga 


cggggaaagc 


cggcgaacgt 


ggcgagaaag 


6120 


gaagggaaga 


aagcgaaagg 


agcgggcgct 


agggcgctgg 


caagtgtagc 


ggtcacgctg 


6180 


cgcgtaacca 


ccacacccgc 


cgcgcttaat 


gcgccgctac 


agggcgcgtc 




6230 


<210> 2 
<2U> 5982 
<212> DNA 

<213> Artificial sequence 










<220> 

<223> Plasmid pCMV-l uc-attB 










<400> 2 
ctctatcgat 


aggtaccgag 


ctcttacgcg 


tgctagccct 


cgagcaggat 


ctatacattg 


60 


aatcaatatt 


ggcaattagc 


catattagtc 


attggttata 


tagcataaat 


caatattggc 


120 


tattggccat 


tgcatacgtt 


gtatctatat 


cataatatgt 


acatttatat 


tggctcatgt 


180 


ccaatatgac 


cgccatgttg 


acattgatta 


ttgactagtt 


attaatagta 


atcaattacg 


240 


gggtcattag 


ttcatagccc 


atatatggag 


ttccgcgtta 


cataacttac 


ggtaaatggc 


300 


ccgcctggct 


gaccgcccaa 


cgacccccgc 


ccattgacgt 


caataatgac 


gtatgttccc 


360 
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atagtaacgc 


caatagggac 


tttccattga 


025CIP7SEQ 
cgtcaatggg 


List . tXt 
tggagtattt 


acggtaaact 


420 


gcccacttgg 


cagtacatca 


agtgtatcat 


atgccaagtc 


cgccccctat 


tgacgtcaat 


480 


gacggtaaat 


ggcccgcctg 


gcattatgcc 


cagtacatga 


ccttacggga 


ctttcctact 


540 


tggcagtaca 


tctacgtatt 


agtcatcgct 


attaccatgg 


tgatgcggtt 


ttggcagtac 


600 


atcaatgggc 


gtggatagcg 


gtttgactca 


cggggatttc 


caagtctcca 


ccccattgac 


660 


gtcaatggga 


gtttgttttg 


gcaccaaaat 


caacgggact 


ttccaaaatg 


tcgtaacaac 


720 


tccgccccat 


tgacgcaaat 


ggqcqgtagQ 


cgtgtacggt 


gggaggtcta 


tataagcaga 


780 


gctcgtttag 


tgaaccgtca 


gatcgcctgg 


agacgccatc 


cacgctgttt 


tgacctccat 


840 


agaagacacc 


gggaccgatc 


cagcctcccc 


tcgaagctcg 


actctagggg 


ctcgagatct 


900 


gcgatctaag 


taagcttggc 


attccggtac 


tgttggtaaa 


gccaccatgg 


aagacgccaa 


960 


aaacataaag 


aaaggcccgg 


cgccattcta 


tccgctggaa 


gatggaaccg 


ctggagagca 


1020 


actgcataag 


gctatgaaga 


gatacgccct 


ggttcctgga 


acaattgctt 


ttacagatgc 


1080 


acatatcgag 


gtggacatca 


cttacgctga 


gtacttcgaa 


atgtccgttc 


ggttggcaga 


1140 


agctatgaaa 


cgatatgggc 


tgaatacaaa 


tcacagaatc 


gtcgtatgca 


gtgaaaactc 


1200 


tcttcaattc 


tttatgccgg 


tqttqqqcqc 


gttatttatc 


ggagttgcag 


ttgcgcccgc 


1260 


gaacgacatt 


tataatgaac 


gtgaattgct 


caacagtatg 


ggcatttcgc 


agcctaccgt 


1320 


ggtgttcgtt 


tccaaaaagg 


ggttgcaaaa 


aattttgaac 


gtgcaaaaaa 


agctcccaat 


1380 


catccaaaaa 


attattatca 


tggattctaa 


aacggattac 


cagggatttc 


agtcgatgta 


1440 


c a c gt. t n gt, c 


a ca t. ct cat c. 


tacctcccgg 


ttttaatgaa 


tacgattttg 


tgccagagtc 


1500 


cttcyatagg 


gacaagacad 


ttgcactgat 


cat, gaactcc 


tctggatcta 


ctggtctgcc 


1560 


taaaggtgtc 


gctctgcctc 


atagaactgc 


ctgcgtgaga 


ttctcgcatg 


oca gaga tec 


1620 


tatttttgqc 


aatcaaatca 


ttccqqatac 


tqcgatttta 


aqtqttqttc 


cattccatca 


1680 


cqqttttgqa 

-} _} ^ ^ ^ ^ _} _} ^ 


atgtttacta 


cactcggata 


tttgatatgt 


qgatttcqaq 


tcgtcttaat 


1740 


gtatagattt 


gaagaagagc 


tgtttctgag 


gagccttcag 


gattacaaga 


ttcaaagtgc 


1800 


gctgctggtg 


ccaaccctat 


tctccttctt 


cgccaaaagc 


actctgattg 


acaaatacga 


1860 


tttatctaat 


ttacacgaaa 


ttgcttctgg 


tggcgctccc 


ctctctaagg 


aagtcgggga 


1920 


agcggttgcc 


aagaggttcc 


atctgccagg 


tatcaggcaa 


ggatatgggc 


tcactgagac 


1980 


tacatcagct 


attctgatta 


cacccgaggg 


ggatgataaa 


ccgggcgcgg 


tcggtaaagt 


2040 


tgttccattt 


tttgaagcga 


aggttgtgga 


tctggatacc 


gggaaaacgc 


tgggcgttaa 


2100 


tcaaagaggc 


gaactgtgtg 


tgagaggtcc 


tatgattatg 


tccggttatg 


taaacaatcc 


2160 


ggaagcgacc 


aacgccttga 


ttgacaagga 


tggatggcta 


cattctggag 


acatagctta 


2220 


ctgggacgaa 


gacgaacact 


tcttcatcgt 


tgaccgcctg 
Page 


aagtctctga 
5 


ttaagtacaa 
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025CIP7SEQ 


List . tXt 






aggctatcag 


gtggctcccg 


ctgaattgga 


atccatcttg 


ctccaacacc 


ccaacatctt 


2340 


cgacgcaggt 


gtcgcaggtc 


ttcccgacga 


tgacgccggt 


gaacttcccg 


ccgccgttgt 


2400 


tgttttggag 


cacggaaaga 


cgatgacgga 


aaaagagatc 


gtggattacg 


tcgccagtca 


2460 


agtaacaacc 


gcgaaaaagt 


tgcgcggagg 


agttgtgttt 


gtggacgaag 


taccgaaagg 


2520 


tcttaccgga 


aaact cgacg 


caagaaaaat 


cagagagatc 


ctcataaagg 


ccaagaaggg 


2580 


cggaaagatc 


gccgtgtaat 


tctagagtcg 


gggcggccgg 


ccgcttcgag 


cagacatgat 


2640 


aagatacatt 


gatgagtttg 


gacaaaccac 


aactagaatg 


cagtgaaaaa 


aatgctttat 


2700 


ttgtgaaatt 


tgtgatgcta 


ttgctttatt 


tgtaaccatt 


ataagctgca 


ataaacaagt 


2760 


taacaacaac 


aattgcattc 


attttatgtt 


tcaggttcag 


ggggaggtgt 


gggaggtttt 


2820 


ttaaagcaag 


taaaacctct 


acaaatgtgg 


taaaatcgat 


aaggatcaat 


tcggcttcag 


2880 


gtaccgtcga 


cgatgtaggt 


cacggtctcg 


aagccgcggt 


gcgggtgcca 


gggcgtgccc 


2940 


ttgggctccc 


cgggcgcgta 


ctccacctca 


cccatctggt 


ccatcatgat 


gaacgggtcg 


3000 


aggtggcggt 


agttgatccc 


ggcgaacgcg 


cggcgcaccg 


ggaagccctc 


gccctcgaaa 


3060 


ccgctgggcg 


cggtggtcac 


ggtgagcacg 


ggacgtgcga 


cggcgtcggc 


gggtgcggat 


3120 


acgcggggca 


gcgtcagcgg 


gttct cgacg 


gtcacggcgg 


gcatgtcgac 


agccgaattg 


3180 


atccgtcgac 


cgatgccctt 


gagagccttc 


aacccagtca 


gctccttccg 


gtgggcgcgg 


3240 


ggcatgacta 


tcgtcgccgc 


acttatgact 


gtcttcttta 


tcatgcaact 


cgtaggacag 


3300 


gtgccggcag 


cgctcttccg 


cttcctcgct 


cactgactcg 


ctgcgctcgg 


tcgttcggct 


3360 


gcggcgagcg 


gtatcagctc 


actcaaaggc 


ggtaatacgg 


ttatccacag 


aatcagggga 


3420 


taacgcagga 


aagaacatgt 


gagcaaaagg 


ccagcaaaag 


gccaggaacc 


gtaaaaaggc 


3480 


cgcgttgctg 


gcgtttttcc 


ataggctccg 


cccccctgac 


gagcatcaca 


aaaatcgacg 


3540 


ctcaagtcag 


aggtggcgaa 


acccgacagg 


actataaaga 


taccaggcgt 


ttccccctgg 


3600 


aagctccctc 


gtgcgctctc 


ctgttccgac 


cctgccgctt 


accggatacc 


tgtccgcctt 


3660 


tctcccttcg 


ggaagcgt gg 


cgctttctca 


atgctcacgc 


tgtaggtatc 


tcagttcggt 


3720 


gtaggtcgtt 


cgctccaagc 


tgggctgtgt 


gcacgaaccc 


cccgttcagc 


ccgaccgctg 


3780 


cgccttatcc 


ggtaactatc 


gtcttgagtc 


caacccggta 


agacacgact 


tatcgccact 


o o yi 
Jo4U 


ggcagcagcc 


actggtaaca 


ggattagcag 


agcgaggtat 


gtaggcggtg 


ctacagagtt 


3900 


cttgaagtgg 


tggcctaact 


acggctacac 


tagaaggaca 


gtatttggta 


tctgcgctct 


3960 


gctgaagcca 


gttaccttcg 


gaaaaagagt 


tggtagctct 


tgatccggca 


aacaaaccac 


4020 


cgctggtagc 


ggtggttttt 


ttgtttgcaa 


gcagcagatt 


acgcgcagaa 


aaaaaggatc 


4080 


tcaagaagat 


cctttgatct 


tttctacggg 


gtctgacgct 


cagtggaacg 


aaaactcacg 


4140 
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ttaagggatt 


ttggtcatga 


gattatcaaa 


025CIP7SEQ 
aaggatcttc 


List .txt 
acctagatcc 


ttttaaatta 


4200 


aaaatgaagt 


tttaaatcaa 


tctaaagtat 


atatgagtaa 


acttggtctg 


acagttacca 


4260 


atgcttaatc 


agtgaggcac 


ctatctcagc 


gatctgtcta 


tttcgttcat 


ccatagttgc 


4320 


ctgactcccc 


gtcgtgtaga 


taactacgat 


acgggagggc 


ttaccatctg 


gccccagtgc 


4380 


tgcaatgata 


ccgcgagacc 


cacgctcacc 


ggctccagat 


ttatcagcaa 


taaaccagcc 


4440 


agccggaagg 


gccgagcgca 


gaagtggtcc 


tgcaacttta 


tccgcctcca 


tccagtctat 


4500 


taattgttgc 


cgggaagcta 


gagtaagtag 


ttcgccagtt 


aatagtttgc 


gcaacgttgt 


4560 


tgccattgct 


acaggcatcg 


tggtgtcacg 


ctcgtcgttt 


ggtatggctt 


cattcagctc 


4620 


cggttcccaa 


cgatcaaggc 


gagttacatg 


atcccccatg 


ttgtgcaaaa 


aagcggttag 


4680 


ctccttcggt 


cctccgatcg 


ttgtcagaag 


taagttggcc 


gcagt gttat 


cactcatggt 


4740 


tatggcagca 


ctgcataatt 


ctcttactgt 


catgccatcc 


gtaagatgct 


tttctgtgac 


4800 


tggtgagtac 


tcaaccaagt 


cattctgaga 


atagtgtatg 


cggcgaccga 


gttgctcttg 


4860 


cccggcgtca 


atacgggata 


ataccgcgcc 


acatagcaga 


actttaaaag 


tgctcatcat 


4920 


tggaaaacgt 


tcttcggggc 


gaaaactctc 


aaggatctta 


ccgctgttga 


gatccagttc 


4980 


gatgtaaccc 


actcgtgcac 


ccaactgatc 


ttcagcatct 


tttactttca 


ccagcgtttc 


5040 


tgggtgagca 


aaaacaggaa 


ggcaaaatgc 


cgcaaaaaag 


ggaataaggg 


cgacacggaa 


5100 


atgttgaata 


ctcatactct 


tcctttttca 


atattattga 


agcatttatc 


agggttattg 


5160 


tctcatgagc 


ggatacatat 


ttgaatgtat 


ttagaaaaat 


aaacaaatag 


gggttccgcg 


5220 


cacatttccc 


cgaaaagtgc 


cacctgacgc 


gccctgtagc 


ggcgcattaa 


gcgcggcggg 


5280 


tgtggtggtt 


acgcgcagcg 


tgaccgctac 


acttgccagc 


gccctagcgc 


ccgctccttt 


5340 


cgctttcttc 


ccttcctttc 


tcgccacgtt 


cgccggcttt 


ccccgtcaag 


ctctaaatcg 


5400 


ggggctccct 


ttagggttcc 


gatttagtgc 


tttacggcac 


ctcgacccca 


aaaaacttga 


5460 


ttagggtgat 


ggttcacgta 


gtgggccatc 


gccctgatag 


acggtttttc 


gccctttgac 


5520 


gttggagtcc 


acgttcttta 


atagtggact 


cttgttccaa 


actggaacaa 


cactcaaccc 


5580 


tatctcggtc 


tattcttttg 


atttataagg 


gattttgccg 


atttcggcct 


attggttaaa 


5540 


;3 :3 "H 1^ ^ "H 








cLdddUciuUcLcL 


C U Cdi^ddU 


5 70 0 


ttcccattcg 


ccattcaggc 


tgcgcaactg 


ttgggaaggg 


cgatcggtgc 


gggcctcttc 


5760 


gctattacgc 


cagcccaagc 


taccatgata 


agtaagtaat 


attaaggtac 


gggaggtact 


5820 


tggagcggcc 


gcaataaaat 


atctttattt 


tcattacatc 


tgtgt gttgg 


ttttttgtgt 


5880 


gaatcgatag 


tactaacata 


cgctctccat 


caaaacaaaa 


cgaaacaaaa 


caaactagca 


5940 


aaataggctg 


tccccagtgc 


aagtgcaggt 


gccagaacat 


tt 




5982 
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025CIP7SEQ listtXt 

<210> 3 

<211> 5924 

<212> DNA 

<213> Artifi cial sequence 

<220> 

<223> Plasmid pCMV-1 uc-attP 

<400> 3 



ctctatcgat 


aggtaccgag 


ctcttacgcg 


tgctagccct 


cgagcaggat 


ctatacattg 


60 


aatcaatatt 


ggcaattagc 


catattagtc 


attggttata 


tagcataaat 


caatattggc 


120 


tattggccat 


tgcatacgtt 


gtatctatat 


cataatatgt 


acatttatat 


tggctcatgt 


180 


ccaatatgac 


cgccatgttg 


acattgatta 


ttgactagtt 


attaatagta 


atcaattacg 


240 


gggtcattag 


ttcatagccc 


atatatggag 


ttccgcgtta 


cataacttac 


ggtaaatggc 


300 


ccgcctggct 


gaccgcccaa 


cgacccccgc 


ccattgacgt 


caataatgac 


gtatgttccc 


360 


atagtaacgc 


caatagggac 


tttccattga 


cgtcaatggg 


tggagtattt 


acggtaaact 


420 


gcccacttgg 


cagtacatca 


agtgtatcat 


atgccaagtc 


cgccccctat 


tgacgtcaat 


480 


gacggtaaat 


ggcccgcctg 


gcattatgcc 


cagtacatga 


ccttacggga 


ctttcctact 


540 


tggcagtaca 


tctacgtatt 


agtcatcgct 


attaccatgg 


tgatgcggtt 


ttggcagtac 


600 


atcaatgggc 


gtggatagcg 


gtttgactca 


cggggatttc 


caagtctcca 


ccccattgac 


660 


gtcaatggga 


gtttgttttg 


gcaccaaaat 


caacgggact 


ttccaaaatg 


tcgtaacaac 


720 


tccgccccat 


tgacgcaaat 


gggcggtagg 


cgtgtacggt 


gggaggtcta 


tataagcaga 


780 


gctcgtttag 


tgaaccgtca 


gatcgcctgg 


agacgccatc 


cacgctgttt 


tgacctccat 


840 


agaagacacc 


gggaccgatc 


cagcctcccc 


tcgaagctcg 


actctagggg 


ctcgagatct 


900 


gcgatctaag 


taagcttggc 


attccggtac 


tgttggtaaa 


gccaccatgg 


aagacgccaa 


960 


aaacataaag 


aaaggcccgg 


cgccattcta 


tccgctggaa 


gatggaaccg 


ctggagagca 


1020 


actgcataag 


gctatgaaga 


gatacgccct 


ggttcctgga 


acaattgctt 


ttacagatgc 


1080 


acatatcgag 


gtggacatca 


cttacgctga 


gtacttcgaa 


atgtccgttc 


ggttggcaga 


1140 


agctatgaaa 


cgatatgggc 


tgaatacaaa 


tcacagaatc 


gtcgtatgca 


gtgaaaactc 


1200 


tcttcaattc 


tttatgccgg 


tgttgggcgc 


gttatttatc 


ggagttgcag 


ttgcgcccgc 


1260 


gaacgacatt 


tataatgaac 


gtgaattgct 


caacagtatg 


ggcatttcgc 


agcctaccgt 


1320 


ggtgttcgtt 


tccaaaaagg 


ggttgcaaaa 


aattttgaac 


gtgcaaaaaa 


agctcccaat 


1380 


catccaaaaa 


attattatca 


tggattctaa 


aacggattac 


cagggatttc 


agtcgatgta 


1440 


cacgttcgtc 


acatctcatc 


tacctcccgg 


ttttaatgaa 


tacgattttg 


tgccagagtc 


1500 


cttcgatagg 


gacaagacaa 


ttgcactgat 


catgaactcc 


tctggatcta 


ctggtctgcc 


1560 


taaaggtgtc 


gctctgcctc 


atagaactgc 


ctgcgtgaga 

page 
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tatttttggc 


aatcaaatca 


ttccggatac 


025CIP7SEQ 
tgcgatttta 


List.tXt 

agtgttgttc 


cattccatca 


1680 


cggttttgga 


atgtttacta 


cactcggata 


tttgatatgt 


ggatttcgag 


tcgtcttaat 


1740 


gtatagattt 


gaagaagagc 


tgtttctgag 


gagccttcag 


gattacaaga 


ttcaaagtgc 


1800 


gctgctggtg 


ccaaccctat 


tctccttctt 


cgccaaaagc 


actctgattg 


acaaatacga 


1860 


tttatctaat 


ttacacgaaa 


ttgcttctgg 


tggcgctccc 


ctctctaagg 


aagtcgggga 


1920 


agcggttgcc 


aagaggttcc 


atctgccagg 


tatcaggcaa 


ggatatgggc 


tcactgagac 


1980 


tacatcagct 


attctgatta 


cacccgaggg 


ggatgataaa 


ccgggcgcgg 


tcggtaaagt 


2040 


tgttccattt 


tttgaagcga 


aggttgtgga 


tctggatacc 


gggaaaacgc 


tgggcgttaa 


2100 


tcaaagaggc 


gaactgtgtg 


tgagaggtcc 


tatgattatg 


tccggttatg 


taaacaatcc 


2160 


ggaagcgacc 


aacgccttga 


ttgacaagga 


tggatggcta 


cattctggag 


acatagctta 


2220 


ctgggacgaa 


gacgaacact 


tcttcatcgt 


tgaccgcctg 


aagtctctga 


ttaagtacaa 


2280 


aggctatcag 


gt ggctcccg 


ctgaattgga 


atccatcttg 


ctccaacacc 


ccaacatctt 


2340 


cgacgcaggt 


gtcgcaggtc 


ttcccgacga 


tgacgccggt 


gaacttcccg 


ccgccgttgt 


2400 


tgttttggag 


cacggaaaga 


cgatgacgga 


aaaagagatc 


gtggattacg 


tcgccagtca 


2460 


agtaacaacc 


gcgaaaaagt 


tgcgcggagg 


agttgtgttt 


gtggacgaag 


taccgaaagg 


2520 


tcttaccgga 


aaactcgacg 


caagaaaaat 


cagagagatc 


ctcataaagg 


ccaagaaggg 


2580 


cggaaagatc 


gccgtgtaat 


tctagagt eg 


gggcggccgg 


ccgcttcgag 


cagacatgat 


2640 


adyril-rlCrl I. L 


gatgagtt Ly 


yacaaaccac 


aactagaatg 


cayLgaaaaa 


aaLgc L I. LaL 


2700 


ttgtgaaatt 


tgtgatgcta 


ttgctttatt 


tgtaaccatt 


ataagctgca 


ataaacaagt 


2760 


taacaacaac 


aattgcattc 


attttatgtt 


tcaggttcag 


ggggaggtgt 


gggaggtttt 


2820 


ttaaagcaag 


taaaacctct 


acaaatgtgg 


taaaatcgat 


aaggatcaat 


tcggcttcga 


2880 


ctagtactga 


cggacacacc 


gaagccccgg 


cggcaaccct 


cagcggatgc 


cccggggctt 


2940 


cacgttttcc 


caggtcagaa 


gcggttttcg 


ggagtagtgc 


cccaactggg 


gtaacctttg 


3000 


agttctctca 


gttgggggcg 


tagggtcgcc 


gacatgacac 


aaggggttgt 


gaccggggtg 


3060 


gacacgtacg 


cgggtgctta 


cgaccgtcag 


tcgcgcgagc 


gcgactagta 


caagccgaat 


3120 


tgatccgtcg 


accgatgccc 


ttgagagcct 


tcaacccagt 


cagctccttc 


cggtgggcgc 


3180 


ggggcatgac 


tatcgtcgcc 


gcacttatga 


ctgtcttctt 


tatcatgcaa 


ctcgtaggac 


3240 


aggtgccggc 


agcgctcttc 


cgcttcctcg 


ctcactgact 


cgctgcgctc 


ggtcgttcgg 


3300 


ctgcggcgag 


cggtatcagc 


tcactcaaag 


gcggtaatac 


ggttatccac 


agaatcaggg 


3360 


gataacgcag 


gaaagaacat 


gtgagcaaaa 


ggccagcaaa 


aggccaggaa 


ccgtaaaaag 


3420 


gccgcgttgc 


tggcgttttt 


ccataggctc 


cgcccccctg 


acgagcatca 


caaaaatcga 


3480 


cgctcaagtc 


agaggtggcg 


aaacccgaca 


ggactataaa 
Page 
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gtttccccct 


3540 
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025CIP7SEQ 


List.tXt 






ggaagctccc 


tcgtgcgctc 


tcctgttccg 


accctgccgc 


ttaccggata 


cctgtccgcc 


3600 


tttctccctt 


cgggaagcgt 


ggcgctttct 


caatgctcac 


gctgtaggta 


tctcagttcg 


3660 


gtgtaggtcg 


ttcgctccaa 


gctgggctgt 


gtgcacgaac 


cccccgttca 


gcccgaccgc 


3720 


tgcgccttat 


ccggtaacta 


tcgtcttgag 


tccaacccgg 


taagacacga 


cttatcgcca 


3780 


ctggcagcag 


ccactggtaa 


caggattagc 


agagcgaggt 


atgtaggcgg 


tgctacagag 


3840 


ttcttgaagt 


ggtggcctaa 


ctacggctac 


actagaagga 


cagtatttgg 


tatctgcgct 


3900 


ctgctgaagc 


cagttacctt 


cggaaaaaga 


gttggtagct 


cttgatccgg 


caaacaaacc 


3960 


accgctggta 


gcggtggttt 


ttttgtttgc 


aagcagcaga 


ttacgcgcag 


aaaaaaagga 


4020 


tctcaagaag 


atcctttgat 


cttttctacg 


gggtctgacg 


ctcagtggaa 


cgaaaactca 


4080 


cgttaaggga 


ttttggtcat 


gagattatca 


aaaaggatct 


tcacctagat 


ccttttaaat 


4140 


taaaaatgaa 


gttttaaatc 


aatctaaagt 


atatatgagt 


aaacttggtc 


tgacagttac 


4200 


caatgcttaa 


tcagtgaggc 


acctatctca 


gcgatctgtc 


tatttcgttc 


atccatagtt 


4260 


gcctgactcc 


ccgtcgtgta 


gataactacg 


atacgggagg 


gcttaccatc 


tggccccagt 


4320 


gctgcaatga 


taccgcgaga 


cccacgctca 


ccggctccag 


atttatcagc 


aataaaccag 


4380 


ccagccggaa 


gggccgagcg 


cagaagt ggt 


cctgcaactt 


tatccgcctc 


catccagtct 


4440 


attaattgtt 


gccgggaagc 


tagagtaagt 


agttcgccag 


ttaatagttt 


gcgcaacgtt 


4500 


gttgccattg 


ctacaggcat 


cgtggtgtca 


cgctcgtcgt 


ttggtatggc 


ttcattcagc 


4b60 


tccggttccc 


aacgatcaag 


gcgagttaca 


tgatccccca 


tgttgtgcaa 


aaaagcggtt 


4620 


agctccttcg 


gtcctccgat 


cgttgtcaga 


agtaagttgg 


ccgcagtgtt 


atcactcatg 


4680 


gttatggcag 


cactgcataa 


ttctcttact 


gtcatgccat 


ccgtaagatg 


cttttctgtg 


4740 


actggtgagt 


actcaaccaa 


gtcattctga 


gaatagt gta 


tgcggcgacc 


gagttgctct 


4800 


tgcccggcgt 


caatacggga 


taataccgcg 


ccacatagca 


gaactttaaa 


agtgctcatc 


4860 


attggaaaac 


gttcttcggg 


gcgaaaactc 


tcaaggatct 


taccgctgtt 


gagatccagt 


4920 


tcgatgtaac 


ccactcgtgc 


acccaactga 


tcttcagcat 


cttttacttt 


caccagcgtt 


4980 


tctgggtgag 


caaaaacagg 


aaggcaaaat 


gccgcaaaaa 


agggaataag 


ggcgacacgg 


5040 


aaatgttgaa 


tactcatact 


cttccttttt 


caatattatt 


gaagcattta 


tcagggttat 


5100 




g(^ggcLucL(^cLi_ 


ctuuugcLcLugu 


a "1" 1" 1" ^3 rt^ 3 3 3 

CLl^ U UCLgcLCLCLCL 


CLUCLCLCLt^CLCLCLU 




5160 


cgcacatttc 


cccgaaaagt 


gccacctgac 


gcgccctgta 


gcggcgcatt 


aagcgcggcg 


5220 


ggtgtggtgg 


ttacgcgcag 


cgtgaccgct 


acacttgcca 


gcgccctagc 


gcccgctcct 


5280 


ttcgctttct 


tcccttcctt 


tctcgccacg 


ttcgccggct 


ttccccgtca 


agctctaaat 


5340 


cgggggctcc 


ctttagggtt 


ccgatttagt 


gctttacggc 


acctcgaccc 


caaaaaactt 


5400 
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gattagggtg 


atggttcacg 


tagtgggcca 


tcgccctgat 


agacggtttt 


tcgccctttg 


5460 


acgttggagt 


ccacgttctt 


taatagtgga 


ctcttgttcc 


aaactggaac 


aacactcaac 


5520 


cctatctcgg 


tctattcttt 


tgatttataa 


gggattttgc 


cgatttcggc 


ctattggtta 


5580 


aaaaatgagc 


tgatttaaca 


aaaatttaac 


gcgaatttta 


acaaaatatt 


aacgtttaca 


56 40 


atttcccatt 


cgccattcag 


gctgcgcaac 


tgttgggaag 


ggcgatcggt 


gcgggcctct 


5700 


tcgctattac 


gccagcccaa 


gctaccatga 


taagtaagta 


atattaaggt 


acgggaggta 


5760 


cttggagcgg 


ccgcaataaa 


atatctttat 


tttcattaca 


tctgtgtgtt 


ggttttttgt 


5820 


gtgaatcgat 


agtactaaca 


tacgctctcc 


atcaaaacaa 


aacgaaacaa 


aacaaactag 


5880 


caaaataggc 


tgtccccagt 


gcaagtgcag 


gtgccagaac 


attt 




5924 


<210> 4 














<211> 5101 














<212> DNA 














<213> Artifi 


cial sequence 










<220> 














<223> Plasmid pCMV-pur-attB 










<400> 4 














ctagagtcgg 


ggcggccggc 


cgcttcgagc 


agacatgata 


agatacattg 


atgagtttgg 


60 


acaaaccaca 


actagaatgc 


agtgaaaaaa 


atgctttatt 


tgtgaaattt 


gtgatgctat 


120 


tgctttattt 


gtaaccatta 


taagctgcaa 


taaacaagtt 


aacaacaaca 


attgcattca 


180 


ttttatgttt 


caggttcagg 


gggaggtgtg 


ggaggttttt 


taaagcaagt 


aaaacctcta 


240 


caaatgtggt 


aaaatcgata 


aggatcaatt 


cggcttcagg 


taccgtcgac 


gatgtaggtc 


300 


acggtctcga 


agccgcggtg 


cgggtgccag 


ggcgtgccct 


tgggctcccc 


gggcgcgtac 


360 


tccacctcac 


ccatctggtc 


catcatgatg 


aacgggtcga 


ggtggcggta 


gttgatcccg 


420 


gcgaacgcgc 


ggcgcaccgg 


gaagccctcg 


ccctcgaaac 


cgctgggcgc 


ggtggtcacg 


480 


gtgagcacgg 


gacgtgcgac 


ggcgtcggcg 


ggtgcggata 


cgcggggcag 


cgtcagcggg 


540 


ttctcgacgg 


tcacggcggg 


catgtcgaca 


gccgaattga 


tccgtcgacc 


gatgcccttg 


600 


agagccttca 


acccagtcag 


ctccttccgg 


tgggcgcggg 


gcatgactat 


cgtcgccgca 


660 


cttatgactg 


tcttctttat 


catgcaactc 


gtaggacagg 


tgccggcagc 


gctcttccgc 


720 


ttcctcgctc 


actgactcgc 


tgcgctcggt 




1^ <T (T 1^ (T (T 1^ (T 




780 


ctcaaaggcg 


gtaatacggt 


tatccacaga 


atcaggggat 


aacgcaggaa 


agaacatgtg 


840 


agcaaaaggc 


cagcaaaagg 


ccaggaaccg 


taaaaaggcc 


gcgttgctgg 


cgtttttcca 


900 


taggctccgc 


ccccctgacg 


agcatcacaa 


aaatcgacgc 


tcaagtcaga 


ggtggcgaaa 


960 


cccgacagga 


ctataaagat 


accaggcgtt 


tccccctgga 


agctccctcg 


tgcgctctcc 


1020 


tgttccgacc 


ctgccgctta 


ccggatacct 


gtccgccttt 


ctcccttcgg 


gaagcgtggc 


1080 
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025CIP7SEQ 


List .txt 






gctttctcaa 


tgctcacgct 


gtaggtatct 


cagttcggtg 


taggtcgttc 


gctccaagct 


1140 


gggctgtgtg 


cacgaacccc 


ccgttcagcc 


cgaccgctgc 


gccttatccg 


gtaactatcg 


1200 


tcttgagtcc 


aacccggtaa 


gacacgactt 


atcgccactg 


gcagcagcca 


ctggtaacag 


1260 


gattagcaga 


gcgaggtatg 


taggcggtgc 


tacagagttc 


ttgaagtggt 


ggcctaacta 


1320 


cggctacact 


agaaggacag 


tatttggtat 


ctgcgctctg 


ctgaagccag 


ttaccttcgg 


1380 


aaaaagagtt 


ggtagctctt 


gatccggcaa 


acaaaccacc 


gctggtagcg 


gtggtttttt 


1440 


tgtttgcaag 


cagcagatta 


cgcgcagaaa 


aaaaggatct 


caagaagatc 


ctttgatctt 


1500 


ttctacgggg 


tctgacgctc 


agtggaacga 


aaactcacgt 


taagggattt 


tggtcatgag 


1560 


attatcaaaa 


aggatcttca 


cctagatcct 


tttaaa 1. 1, aa 


aaatgaagtt 


ttaaatcaat 


1520 


ctaaagtata 


tatgagtaaa 


cttggtctga 


cagttaccaa 


tgcttaatca 


gtgaggcacc 


1680 


tatctcagcg 


atctgtctat 


ttcgttcatc 


catagttgcc 


tgactccccg 


tcgtgtagat 


1740 


aactacgata 


cgggagggct 


taccatctgg 


ccccagtgct 


gcaatgatac 


cgcgagaccc 


1800 


acgctcaccg 


gctccagatt 


tatcagcaat 


aaaccagcca 


gccggaaggg 


ccgagcgcag 


1860 


aagtggtcct 


gcaactttat 


ccgcctccat 


ccagtctatt 


aattgttgcc 


gggaagctag 


1920 


agtaagtagt 


tcgccagtta 


atagtttgcg 


caacgttgtt 


gccattgcta 


caggcatcgt 


1980 


ggtgtcacgc 


tcgtcgtttg 


gtatggcttc 


attcagctcc 


ggttcccaac 


gatcaaggcg 


2040 


agttacatga 


tcccccatgt 


tgtgcaaaaa 


agcggttagc 


tccttcggtc 


ctccgatcgt 


2100 


tgtcagaagt 


aagttggccg 


cagtgttatc 


actcatggtt 


atggcagcac 


tgcataattc 


2160 


tcttactgtc 


atgccatccg 


taagatgctt 


ttctgtgact 


ggtgagtact 


caaccaagtc 


2220 


attctgagaa 


tagtgtatgc 


ggcgaccgag 


ttgctcttgc 


ccggcgtcaa 


tacgggataa 


2280 


taccgcgcca 


catagcagaa 


ctttaaaagt 


gctcatcatt 


ggaaaacgtt 


cttcggggcg 


2340 


aaaactctca 


aggatcttac 


cgctgttgag 


atccagttcg 


atgtaaccca 


ctcgtgcacc 


2400 


caactgatct 


tcagcatctt 


ttactttcac 


cagcgtttct 


gggtgagcaa 


aaacaggaag 


2460 


gcaaaatgcc 


gcaaaaaagg 


gaataagggc 


gacacggaaa 


tgttgaatac 


tcatactctt 


2520 


cctttttcaa 


tattattgaa 


gcatttatca 


gggttattgt 


ctcatgagcg 


gatacatatt 


2580 


tgaatgtatt 


tagaaaaata 


aacaaatagg 


ggttccgcgc 


acatttcccc 


gaaaagtgcc 


2640 


acctgacgcg 


ccctgtagcg 


gcgcattaag 


cgcggcgggt 


gtggtggtta 


cgcgcagcgt 


2700 


gaccgctaca 


cttgccagcg 


ccctagcgcc 


cgctcctttc 


gctttcttcc 


cttcctttct 


2760 


cgccacgttc 


gccggctttc 


cccgtcaagc 


tctaaatcgg 


gggctccctt 


tagggttccg 


2820 


atttagtgct 


ttacggcacc 


tcgaccccaa 


aaaacttgat 


tagggtgatg 


gttcacgtag 


2880 


tgggccatcg 


ccctgataga 


cggtttttcg 


ccctttgacg 


ttggagtcca 


cgttctttaa 


2940 
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tagtggactc 


ttgttccaaa 


ctggaacaac 


025CIP7SEQ 
actcaaccct 


List . txt 
atctcggtct 


attcttttga 


3000 


tttataaggg 


attttgccga 


tttcggccta 


ttggttaaaa 


aatgagctga 


tttaacaaaa 


3060 


atttaacgcg 


aattttaaca 


aaatattaac 


gtttacaatt 


tcccattcgc 


cattcaggct 


3120 


gcgcaactgt 


tgggaagggc 


gatcggtgcg 


ggcctcttcg 


ctattacgcc 


agcccaagct 


3180 


accatgataa 


gtaagtaata 


ttaaggtacg 


ggaggtactt 


ggagcggccg 


caataaaata 


3240 


tctttatttt 


cattacatct 


gtgtgttggt 


tttttgtgtg 


aatcgatagt 


actaacatac 


3300 


gctctccatc 


aaaacaaaac 


gaaacaaaac 


aaactagcaa 


aataggctgt 


ccccagtgca 


3360 


agtgcaggtg 


ccagaacatt 


tctctatcga 


taggtaccga 


gctcttacgc 


gtgctagccc 


3420 


tcgagcagga 


tctatacatt 


gaatcaatat 


tggcaattag 


ccatattagt 


cattggttat 


3480 


atagcataaa 


tcaatattgg 


ctattggcca 


ttgcatacgt 


tgtatctata 


tcataatatg 


3540 


tacatttata 


ttggctcatg 


tccaatatga 


ccgccatgtt 


gacattgatt 


attgactagt 


3600 


tattaatagt 


aatcaattac 


ggggtcatta 


gttcatagcc 


catatatgga 


gttccgcgtt 


3660 


acataactta 


cggtaaatgg 


cccgcctggc 


tgaccgccca 


acgacccccg 


cccattgacg 


3720 


tcaataatga 


cgtatgttcc 


catagtaacg 


ccaataggga 


ctttccattg 


acgtcaatgg 


3780 


gtggagtatt 


tacggtaaac 


tgcccacttg 


gcagtacatc 


aagtgtatca 


tatgccaagt 


3840 


ccgcccccta 


ttgacgtcaa 


tgacggtaaa 


tggcccgcct 


ggcattatgc 


ccagtacatg 


3900 


accttacggg 


actttcctac 


ttggcagtac 


atctacgtat 


tagtcatcgc 


tattaccatg 


3960 


gtgatgcggt 


tttggcagta 


catcaatqqq 


cgtggatagc 


ggtttgactc 


acggggattt 


4020 


ccaagtctcc 


accccattga 


cgtcaatggg 


agtttgtttt 


ggcaccaaaa 


tcaacgggac 


4080 


tttccaaaat 


gtcgtaacaa 


ctccgcccca 


ttgacgcaaa 


tgggcggtag 


gcgtgtacgg 


4140 


tgggaggtct 


atataagcag 


agctcgttta 


gtgaaccgtc 


agatcgcctg 


gagacgccat 


4200 


ccacgctgtt 


ttgacctcca 


tagaagacac 


cgggaccgat 


ccagcctccc 


ctcgaagctc 


4260 


gactctaggg 


gctcgagatc 


tgcgatctaa 


gtaagcttgc 


atgcctgcag 


gtcggccgcc 


4320 


acgaccggtg 


ccgccaccat 


cccctgaccc 


acgcccctga 


cccctcacaa 


ggagacgacc 


4380 


ttccatgacc 


gagtacaagc 


ccacggtgcg 


cctcgccacc 


cgcgacgacg 


tcccccgggc 


4440 


cgtacgcacc 


ctcgccgccg 


cgttcgccga 


ctaccccgcc 


acgcgccaca 


ccgtcgaccc 


4500 


ClClPi CCCiC C PiC 
>^ >^ d v^v^*^ civ^ 


O ^ ^ ^ 


^ ^ 


^ ^ d d y Cl ^ 


1^ 1^ \^ a.' — ^ 




4560 


cgacatcggc 


aaggtgtggg 


tcgcggacga 


cggcgccgcg 


gtggcggtct 


ggaccacgcc 


4520 


ggagagcgtc 


gaagcggggg 


cggtgttcgc 


cgagatcggc 


ccgcgcatgg 


ccgagttgag 


4680 


cggttcccgg 


ctggccgcgc 


agcaacagat 


ggaaggcctc 


ctggcgccgc 


accggcccaa 


4740 


ggagcccgcg 


tggttcctgg 


ccaccgtcgg 


cgtctcgccc 


gaccaccagg 


gcaagggtct 


4800 


gggcagcgcc 


gtcgtgctcc 


ccggagtgga 


ggcggccgag 


cgcgccgggg 


tgcccgcctt 


4860 
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02bCIP7SEQ List.tXt 



cctggagacc 


tccgcgcccc 


gcaacctccc 


cttctacgag 


cggctcggct 


tcaccgtcac 


4920 


cgccgacgtc 


gaggtgcccg 


aaggaccgcg 


cacctggtgc 


atgacccgca 


aqcccqqtqc 


4980 


ctgacgcccg 


ccccacgacc 


cgcagcgccc 


gaccgaaagg 


agcgcacgac 


cccatggctc 


5040 


cgaccgaagc 


cgacccgggc 


ggccccgccg 


accccgcacc 


cgcccccgag 


gcccaccgac 


5100 


t 












5101 


<210> 5 
<211> 5043 
<212> DNA 

<213> Artificial Sequence 










<220> 

<223> R asmid pCMV-pur-attP 










<400> 5 
ctagagtcgg 


ggcggccggc 


cgcttcgagc 


agacatgata 


agatacattg 


atgagtttgg 


60 


acaaaccaca 


actagaatgc 


agtgaaaaaa 


atgctttatt 


tgtgaaattt 


gtgatgctat 


120 


-I— -I— -I— -I— ^ -I— -I— 4_ 

tgctttattt 


gtaaccatta 


taagctgcaa 


taaacaagtt 


aacaacaaca 


attgcattca 


180 


4- 4- 4- 4- 4- r~ri- 4- 4- 

ttttatgttt 


caggttcagg 


gggaggtgtg 


ggaggttttt 


taaagcaagt 


aaaacctcta 


240 


caaatgtggt 


aaaatcgata 


aggatcaatt 


cggcttcgac 


tagtactgac 


ggacacaccg 


300 


aagccccggc 


ggcaaccctc 


agcggatgcc 


ccggggcttc 


acgttttccc 


aggtcagaag 


360 


cggttttcgg 


gagtagtgcc 


ccaactgggg 


taacctttga 


gttctctcag 


ttgggggcgt 


420 


agggtcgccg 


acatgacaca 


aggggttgtg 


accggggtgg 


acacgtacgc 


gggtgcttac 


480 


gaccgtcagt 


cgcgcgagcg 


cgactagtac 


aagccgaatt 


gatccgtcga 


ccgatgccct 


540 


tgagagcctt 


caacccagtc 


agctccttcc 


ggtgggcgcg 


gggcatgact 


atcgtcgccg 


600 


cacttatgac 


tgtcttcttt 


atcatgcaac 


tcgtaggaca 


ggtgccggca 


gcgctcttcc 


660 


gcttcctcgc 


tcactgactc 


gctgcgctcg 


gtcgttcggc 


tgcggcgagc 


ggtatcagct 


720 


cactcaaagg 


cggtaatacg 


gttatccaca 


gaatcagggg 


ataacgcagg 


aaagaacatg 


780 


tgagcaaaag 


gccagcaaaa 


ggccaggaac 


cgtaaaaagg 


ccgcgttgct 


ggcgtttttc 


840 


cataggctcc 


gcccccctga 


cgagcatcac 


aaaaa^cgac 


gctcaagtca 


gaggt ggcga 


Q n n 
y u u 


aacccgacag 


gactataaag 


ataccaggcg 


tttccccctg 


gaagctccct 


cgtgcgctct 


960 


cctgttccga 


ccctgccgct 


taccggatac 


ctgtccgcct 


ttctcccttc 


gggaagcgtg 


1020 


gcgctttctc 


aatgctcacg 


ctgtaggtat 


ctcagttcgg 


tgtaggtcgt 


tcgctccaag 


1080 


ctgggctgtg 


tgcacgaacc 


ccccgttcag 


cccgaccgct 


gcgccttatc 


cggtaactat 


1140 


cgtcttgagt 


ccaacccggt 


aagacacgac 


ttatcgccac 


tggcagcagc 


cactggtaac 


1200 


aggattagca 


gagcgaggta 


tgtaggcggt 


gctacagagt 


tcttgaagtg 


gtggcctaac 


1260 
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tacggctaca 


ctagaaggac 


agtatttggt 


025CIP7SEQ 
atctgcgctc 


List.tXt 
tgctgaagcc 


agttaccttc 


1320 


ggaaaaagag 


ttggtagctc 


ttgatccggc 


aaacaaacca 


ccgctggtag 


cggtggtttt 


1380 


tttgtttgca 


agcagcagat 


tacgcgcaga 


aaaaaaggat 


ctcaagaaga 


tcctttgatc 


1440 


ttttctacgg 


ggtctgacgc 


tcagtggaac 


gaaaactcac 


gttaagggat 


tttggtcatg 


1500 


agattatcaa 


aaaggatctt 


cacctagatc 


cttttaaatt 


aaaaatgaag 


ttttaaatca 


1560 


atctaaagta 


tatatgagta 


aacttggtct 


gacagttacc 


aatgcttaat 


cagtgaggca 


1620 


cctatctcag 


cgatctgtct 


atttcgttca 


tccatagttg 


cctgactccc 


cgtcgtgtag 


1680 


ataactacga 


tacgggaggg 


cttaccatct 


ggccccagtg 


ctgcaatgat 


accgcgagac 


1740 


ccacgctcac 


cggctccaga 


tttatcagca 


ataaaccagc 


cagccggaag 


ggccgagcgc 


1800 


agaagtggtc 


ctgcaacttt 


atccgcctcc 


atccagtcta 


ttaattgttg 


ccgggaagct 


1860 


agagtaagta 


gttcgccagt 


taatagtttg 


cgcaacgttg 


ttgccattgc 


tacaggcatc 


1920 


gtggtgtcac 


gctcgtcgtt 


tggtatggct 


tcattcagct 


ccggttccca 


acgatcaagg 


1980 


cgagttacat 


gatcccccat 


gttgtgcaaa 


aaagcggtta 


gctccttcgg 


tcctccgatc 


2040 


gttgtcagaa 


gtaagttggc 


cgcagtgtta 


tcactcatgg 


ttatggcagc 


actgcataat 


2100 


tctcttactg 


tcatgccatc 


cgtaagatgc 


ttttctgtga 


ctggtgagta 


ctcaaccaag 


2160 


tcattctgag 


aatagt gtat 


gcggcgaccg 


agttgctctt 


gcccggcgtc 


aatacgggat 


2220 


aataccgcgc 


cacat agcag 


aactttaaaa 


gtgctcatca 


ttggaaaacg 


ttcttcgggg 


2280 


cgaaaacLcL 


caaggaLcL L 


accgcLgLLg 


agaLccagLL 


cgatgtaacc 


cacLcg Lgca 


2340 


cccaactgat 


cttcagcatc 


ttttactttc 


accagcgttt 


ctgggtgagc 


aaaaacagga 


2400 


aggcaaaatg 


ccgcaaaaaa 


gggaataagg 


gcgacacgga 


aatgttgaat 


actcatactc 


2460 


ttcctttttc 


aatattattg 


aagcatttat 


cagggttatt 


gtctcatgag 


cggatacata 


2520 


tttgaatgta 


tttagaaaaa 


taaacaaata 


ggggttccgc 


gcacatttcc 


ccgaaaagtg 


2b80 


ccacctgacg 


cgccctgtag 


cggcgcatta 


agcgcggcgg 


gtgtggtggt 


tacgcgcagc 


2540 


gtgaccgcta 


cacttgccag 


cgccctagcg 


cccgctcctt 


tcgctttctt 


cccttccttt 


2700 


ctcgccacgt 


tcgccggctt 


tccccgtcaa 


gctctaaatc 


gggggctccc 


tttagggttc 


2760 


cgatttagtg 


ctttacggca 


cctcgacccc 


aaaaaacttg 


attagggtga 


tggttcacgt 


2820 


agtgggccat 


cgccctgata 


gacggttttt 


cgccctttga 


cgttggagtc 


cacgttcttt 


O Q Q n 

Z o o U 


aatagtggac 


tcttgttcca 


aactggaaca 


acactcaacc 


ctatctcggt 


ctattctttt 


2940 


gatttataag 


ggattttgcc 


gatttcggcc 


tattggttaa 


aaaatgagct 


gatttaacaa 


3000 


aaatttaacg 


cgaattttaa 


caaaatatta 


acgtttacaa 


tttcccattc 


gccattcagg 


3060 


ctgcgcaact 


gttgggaagg 


gcgatcggtg 


cgggcctctt 


cgctattacg 


ccagcccaag 


3120 


ctaccatgat 


aagtaagtaa 


tattaaggta 


cgggaggtac 


ttggagcggc 


cgcaataaaa 


3180 
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025CIP7SEQ 


List .txt 






tatctttatt 


ttcattacat 


ctgtgtgttg 


gttttttgtg 


tgaatcgata 


gtactaacat 


3240 


acgctctcca 


tcaaaacaaa 


acgaaacaaa 


acaaactagc 


aaaataggct 


gtccccagtg 


3300 


caagtgcagg 


tgccagaaca 


tttctctatc 


gataggtacc 


gagctcttac 


gcgtgctagc 


3360 


cctcgagcag 


gatctataca 


ttgaatcaat 


attggcaatt 


agccatatta 


gtcattggtt 


3420 


atatagcata 


aatcaatatt 


ggctattggc 


cattgcatac 


gttgtatcta 


tatcataata 


3480 


tgtacattta 


tattggctca 


tgtccaatat 


gaccgccatg 


ttgacattga 


ttattgacta 


3540 


gttattaata 


gtaatcaatt 


acggggtcat 


tagttcatag 


cccatatatg 


gagttccgcg 


3600 


ttacataact 


tacggtaaat 


ggcccgcctg 


gctgaccgcc 


caacgacccc 


cgcccattga 


3550 


cgtcaataat 


gacgtatgtt 


cccatagtaa 


cgccaatagg 


gactttccat 


tgacgtcaat 


3720 


gggtggagta 


tttacggtaa 


actgcccact 


tggcagtaca 


tcaagtgtat 


catatgccaa 


3780 


gtccgccccc 


tattgacgtc 


aatgacggta 


aatggcccgc 


ctggcattat 


gcccagtaca 


3840 


tgaccttacg 


ggactttcct 


acttggcagt 


acatctacgt 


attagtcatc 


gctattacca 


3900 


tggtgatgcg 


gttttggcag 


tacatcaatg 


ggcgtggata 


gcggtttgac 


tcacggggat 


3960 


ttccaagtct 


ccaccccatt 


gacgtcaatg 


ggagtttgtt 


ttggcaccaa 


aatcaacggg 


4020 


actttccaaa 


atgtcgtaac 


aactccgccc 


cattgacgca 


aatgggcggt 


aggcgtgtac 


4080 


ggtgggaggt 


ctatataagc 


agagctcgtt 


tagtgaaccg 


tcagatcgcc 


tggagacgcc 


4140 


atccacgctg 


ttttgacctc 


catagaagac 


accgggaccg 


atccagcctc 


ccctcgaagc 


4200 


tcgactctag 


gggctcgaga 


tctgcgatct 


aagtaagctt 


gcatgcctgc 


aggtcggccg 


4260 


ccacgaccgg 


tgccgccacc 


atcccctgac 


ccacgcccct 


gacccctcac 


aaggagacga 


4320 


ccttccatga 


ccgagtacaa 


gcccacggtg 


cgcctcgcca 


cccgcgacga 


cgtcccccgg 


4380 


gccgtacgca 


ccctcgccgc 


cgcgttcgcc 


gactaccccg 


ccacgcgcca 


caccgtcgac 


4440 


ccggaccgcc 


acatcgagcg 


ggtcaccgag 


ctgcaagaac 


tcttcctcac 


gcgcgtcggg 


4500 


ctcgacatcg 


gcaaggtgtg 


ggtcgcggac 


gacggcgccg 


cggtggcggt 


ctggaccacg 


4560 


ccggagagcg 


tcgaagcggg 


ggcggtgttc 


gccgagatcg 


gcccgcgcat 


ggccgagttg 


4620 


agcggttccc 


ggctggccgc 


gcagcaacag 


atggaaggcc 


tcctggcgcc 


gcaccggccc 


4680 


aaggagcccg 


cgtggttcct 


ggccaccgtc 


ggcgtctcgc 


ccgaccacca 


gggcaagggt 


4740 


ctgggcagcg 


ccgtcgtgct 


ccccggagtg 


gaggcggccg 


agcgcgccgg 


ggtgcccgcc 


4800 


ttcctggaga 


cctccgcgcc 


ccgcaacctc 


cccttctacg 


agcggctcgg 


cttcaccgtc 


4860 


accgccgacg 


tcgaggtgcc 


cgaaggaccg 


cgcacctggt 


gcatgacccg 


caagcccggt 


4920 


gcctgacgcc 


cgccccacga 


cccgcagcgc 


ccgaccgaaa 


ggagcgcacg 


accccatggc 


4980 


tccgaccgaa 


gccgacccgg 


gcggccccgc 


cgaccccgca 


cccgcccccg 


aggcccaccg 


5040 
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025CIP7SEQ List.tXt 

act 5043 

<210> 6 

<211> 5041 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Plasmid pCMV-EGFP-attB 

<400> 6 



ctagagtcgg 


ggcggccggc 


cgcttcgagc 


agacatgata 


agatacattg 


atgagtttgg 


60 


acaaaccaca 


actagaatgc 


agtgaaaaaa 


atgctttatt 


tgtgaaattt 


gtgatgctat 


120 


tgctttattt 


gtaaccatta 


taagctgcaa 


taaacaagtt 


aacaacaaca 


attgcattca 


180 


ttttatgttt 


caggttcagg 


gggaggtgtg 


ggaggttttt 


taaagcaagt 


aaaacctcta 


240 


caaatgtggt 


aaaatcgata 


aggatcaatt 


cggcttcagg 


taccgtcgac 


gatgtaggtc 


300 


acggtctcga 


agccgcggtg 


cgggtgccag 


ggcgtgccct 


tgggctcccc 


gggcgcgtac 


360 


tccacctcac 


ccatctggtc 


catcatgatg 


aacgggtcga 


ggtggcggta 


gttgatcccg 


420 


gcgaacgcgc 


ggcgcaccgg 


gaagccctcg 


ccctcgaaac 


cgctgggcgc 


ggtggtcacg 


480 


gtgagcacgg 


gacgtgcgac 


ggcgtcggcg 


ggtgcggata 


cgcggggcag 


cgtcagcggg 


540 


ttctcgacgg 


tcacggcggg 


catgtcgaca 


gccgaattga 


tccgt.cgacc 


gat-gc:t:t:t-t-g 


600 


agagccttca 


acccagtcag 


ctccttccgg 


Lgggcgcggg 


gcatgactat 


cgtcgccgca 


660 


cttatgactg 


tcttctttat 


catgcaactc 


gtaggacagg 


tgccggcagc 


gctcttccgc 


720 


ttcctcgctc 


actgactcgc 


tgcgctcggt 


cgttcggctg 


cggcgagcgg 


tatcagctca 


780 


ctcaaaggcg 


gtaatacggt 


tatccacaga 


atcaqgggat 


aacgcaggaa 


agaacatgtg 


840 


agcaaaaggc 


cagcaaaagg 


ccaggaaccg 


taaaaaggcc 


gcgttgctgg 


cgtttttcca 


900 


taggctccgc 


ccccctgacg 


agcatcacaa 


aaatcgacgc 


tcaagtcaga 


ggtggcgaaa 


950 


cccgacagga 


ctataaagat 


accaggcgtt 


tccccctgga 


agctccctcg 


tgcgctctcc 


1020 


tgttccgacc 


ctgccgctta 


ccggatacct 


gtccgccttt 


ctccctt egg 


gaagcgtggc 


1080 


gctttctcaa 


tgctcacgct 


gtaggtatct 


cagttcggtg 


taggtcgttc 


gctccaagct 


1140 


gggctgtgtg 


cacgaacccc 


ccgttcagcc 


cgaccgctgc 


gccttatccg 


gtaactatcg 


1200 


tcttgagtcc 


aacccggtaa 


gacacgactt 


atcgccactg 


gcagcagcca 


ctggtaacag 


1260 


gattagcaga 


gcgaggtatg 


taggcggtgc 


tacagagttc 


ttgaagtggt 


ggcctaacta 


1320 


cggctacact 


agaaggacag 


tatttggtat 


ctgcgctctg 


ctgaagccag 


ttaccttcgg 


1380 


aaaaagagtt 


ggtagctctt 


gatccggcaa 


acaaaccacc 


gctggtagcg 


gtggtttttt 


1440 


tgtttgcaag 


cagcagatta 


cgcgcagaaa 


aaaaggatct 


caagaagatc 


ctttgatctt 


1500 


ttctacgggg 


tctgacgctc 


agtggaacga 


aaactcacgt 


taagggattt 


tggtcatgag 


1560 
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025CIP7SEQ 


List .txt 






attatcaaaa 


aggatcttca 


cctagatcct 


tttaaattaa 


aaatgaagtt 


ttaaatcaat 


1620 


ctaaagtata 


tatgagtaaa 


cttggtctga 


cagttaccaa 


tgcttaatca 


gtgaggcacc 


1680 


tatctcagcg 


atctgtctat 


ttcgttcatc 


catagttgcc 


tgactccccg 


tcgtgtagat 


1740 


aactacgata 


cgggagggct 


taccatctgg 


ccccagtgct 


gcaatgatac 


cgcgagaccc 


1800 


acgctcaccg 


gctccagatt 


tatcagcaat 


aaaccagcca 


gccggaaggg 


ccgagcgcag 


1860 


aagtggtcct 


gcaactttat 


ccgcctccat 


ccagtctatt 


aattgttgcc 


gggaagctag 


1920 


agtaagtagt 


tcgccagtta 


atagtttgcg 


caacgttgtt 


gccattgcta 


caggcatcgt 


1980 


ggtgtcacgc 


tcgtcgtttg 


gtatggcttc 


attcagctcc 


ggttcccaac 


gatcaaggcg 


2040 


agttacatga 


tcccccatgt 


tgtgcaaaaa 


agcggttagc 


tccttcggtc 


ctccgatcgt 


2100 


tgtcagaagt 


aagttggccg 


cagtgttatc 


actcatggtt 


atggcagcac 


tgcataattc 


2160 


tcttactgtc 


atgccatccg 


taagatgctt 


ttctgtgact 


ggtgagtact 


caaccaagtc 


2220 


attctgagaa 


tagtgtatgc 


ggcgaccgag 


ttgctcttgc 


ccggcgtcaa 


tacgggataa 


2280 


taccgcgcca 


catagcagaa 


ctttaaaagt 


gctcatcatt 


ggaaaacgtt 


cttcggggcg 


2340 


aaaactctca 


aggatcttac 


cgctgttgag 


atccagttcg 


atgtaaccca 


ctcgtgcacc 


2400 


caactgatct 


tcagcatctt 


ttactttcac 


cagcgtttct 


gggtgagcaa 


aaacaggaag 


2460 


gcaaaatgcc 


gcaaaaaagg 


gaataagggc 


gacacggaaa 


tgttgaatac 


tcatactctt 


2520 


cctttttcaa 


tattattgaa 


gcatttatca 


gggttattgt 


ctcatgagcg 


gatacatatt 


2580 


tgaatgtatt 


tagaaaaata 


aacaaatagg 


ggttccgcgc 


acatttcccc 


gaaaagtgcc 


2640 


acctgacgcg 


ccctgtagcg 


gcgcattaag 


cgcggcgggt 


gtggt ggtta 


cgcgcagcgt 


2700 


gaccgctaca 


cttgccagcg 


ccctagcgcc 


cgctcctttc 


gctttcttcc 


cttcctttct 


2760 


cgccacgttc 


gccggctttc 


cccgtcaagc 


tctaaatcgg 


gggctccctt 


tagggttccg 


2820 


atttagtgct 


ttacggcacc 


tcgaccccaa 


aaaacttgat 


tagggtgatg 


gttcacgtag 


2880 


tgggccatcg 


ccctgataga 


cggtttttcg 


ccctttgacg 


ttggagtcca 


cgttctttaa 


2940 


tagtggactc 


ttgttccaaa 


ctggaacaac 


actcaaccct 


atctcggtct 


attcttttga 


3000 


tttataaggg 


attttgccga 


tttcggccta 


ttggttaaaa 


aatgagctga 


tttaacaaaa 


3060 


atttaacgcg 


aattttaaca 


aaatattaac 


gtttacaatt 


tcccattcgc 


cattcaggct 


3120 


gcgcaactgt 


tigggaagggc 


gatcggtgcg 


ggcct ctt eg 


ct att acgcc 


agcccaagct 


3180 


accatgataa 


gtaagtaata 


ttaaggtacg 


ggaggtactt 


ggagcggccg 


caataaaata 


3240 


tctttatttt 


cattacatct 


gtgtgttggt 


tttttgtgtg 


aatcgatagt 


actaacatac 


3300 


gctctccatc 


aaaacaaaac 


gaaacaaaac 


aaactagcaa 


aataggctgt 


ccccagtgca 


3360 


agtgcaggtg 


ccagaacatt 


tctctatcga 


taggtaccga 


gctcttacgc 


gtgctagccc 


3420 
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tcgagcagga 


tctatacatt 


gaatcaatat 


02bCIP7SEQ 
tggcaattag 


List.tXt 
ccatattagt 


cattggttat 


3480 


atagcataaa 


tcaatattgg 


ctattggcca 


ttgcatacgt 


tgtatctata 


tcataatatg 


3540 


tacatttata 


ttggctcatg 


tccaatatga 


ccgccatgtt 


gacattgatt 


attgactagt 


3600 


tattaatagt 


aatcaattac 


ggggtcatta 


gttcatagcc 


catatatgga 


gttccgcgtt 


3660 


acataactta 


cggtaaatgg 


cccgcctggc 


tgaccgccca 


acgacccccg 


cccattgacg 


3720 


tcaataatga 


cgtatgttcc 


catagtaacg 


ccaataggga 


ctttccattg 


acgtcaatgg 


3780 


gt ggagtatt 


tacggtaaac 


tgcccacttg 


gcagtacatc 


aagtgtatca 


tatgccaagt 


3840 


ccgcccccta 


ttgacgtcaa 


tgacggtaaa 


tggcccgcct 


ggcattatgc 


ccagtacatg 


3900 


accttacggg 


actttcctac 


ttggcagtac 


atctacgtat 


tagtcatcgc 


tattaccatg 


3960 


gtgatgcggt 


tttggcagta 


catcaatggg 


cgtggatagc 


ggtttgactc 


acggggattt 


4020 


ccaagtctcc 


accccattga 


cgtcaatggg 


agtttgtttt 


ggcaccaaaa 


tcaacgggac 


4080 


tttccaaaat 


gtcgtaacaa 


ctccgcccca 


ttgacgcaaa 


tgggcggtag 


gcgtgtacgg 


4140 


tgggaggtct 


atataagcag 


agctcgttta 


gtgaaccgtc 


agatcgcctg 


gagacgccat 


4200 


ccacgctgtt 


ttgacctcca 


tagaagacac 


cgggaccgat 


ccagcctccc 


ctcgaagctc 


4260 


gactctaggg 


gctcgagatc 


cccgggtacc 


ggtcgccacc 


atggtgagca 


agggcgagga 


4320 


gctgttcacc 


ggggtggtgc 


ccatcctggt 


cgagctggac 


ggcgacgtaa 


acggccacaa 


4380 


gttcagcgtg 


tccggcgagg 


gcgagggcga 


tgccacctac 


ggcaagctga 


ccctgaay L t 


4440 


catctgcacc 


accggcaagc 


tgcccgtgcc 


ctggcccacc 


ctcgtgacca 


ccctgaccta 


4500 


cggcgtgcag 


tgcttcagcc 


gctaccccga 


ccacatgaag 


cagcacgact 


tcttcaagtc 


4560 


cgccatgccc 


gaaggctacg 


tccaggagcg 


caccatcttc 


ttcaaggacg 


acggcaacta 


4620 


caagacccgc 


gccgaggtga 


agttcgaggg 


cgacaccctg 


gtgaaccgca 


tcgagctgaa 


4680 


gggcatcgac 


ttcaaggagg 


acggcaacat 


cctggggcac 


aagctggagt 


acaactacaa 


4740 


cagccacaac 


gtctatatca 


tggccgacaa 


gcagaagaac 


ggcatcaagg 


tgaacttcaa 


4800 


gatccgccac 


aacatcgagg 


acggcagcgt 


gcagctcgcc 


gaccactacc 


agcagaacac 


4860 


ccccatcggc 


gacggccccg 


tgctgctgcc 


cgacaaccac 


tacctgagca 


cccagtccgc 


4920 


cctgagcaaa 


gaccccaacg 


agaagcgcga 


tcacatggtc 


ctgctggagt 


tcgtgaccgc 


4980 


cgccgggatc 


actctcggca 


tggacgagct 


gtacaagtaa 


agcggccgct 


cgagcatgca 


5040 


t 












5041 



<210> 7 

<211> 18116 

<212> DNA 

<213> Artificial Sequence 



<220> 
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025CIP7SEQ List.tXt 
<223> Plasmid pl2 .Olys-LSPiPNMM-CMV-pur-attB 

<400> 7 



gggctgcagg 


aattcgattg 


ccgccttctt 


tgatattcac 


tctgttgtat 


ttcatctctt 


60 


cttgccgatg 


aaaggatata 


acagtctgta 


taacagtctg 


tgaggaaata 


cttggtattt 


120 


cttctgatca 


gtgtttttat 


aagtaatgtt 


gaatattgga 


taaggctgtg 


tgtcctttgt 


180 


cttgggagac 


aaagcccaca 


gcaggtggtg 


gttggggtgg 


tggcagctca 


gtgacaggag 


240 


aggttttttt 


gcctgttttt 


tttttttttt 


ttttttttaa 


gtaaggtgtt 


cttttttctt 


300 


agtaaatttt 


ctactggact 


gtatgttttg 


acaggtcaga 


aacatttctt 


caaaagaaga 


360 


accttttgga 


aactgtacag 


cccttttctt 


tcattccctt 


tttgctttct 


gtgccaatgc 


420 


ctttggttct 


gattgcatta 


tggaaaacgt 


tgatcggaac 


ttgaggtttt 


tatttatagt 


480 


gtggcttgaa 


agcttggata 


gctgttgtta 


cacgagatac 


cttattaagt 


ttaggccagc 


540 


ttgatgcttt 


attttttccc 


tttgaagtag 


tgagcgttct 


ctggtttttt 


tcctttgaaa 


600 


ctggtgaggc 


ttagattttt 


ctaatgggat 


tttttacctg 


atgatctagt 


tgcataccca 


660 


aatgcttgta 


aatgttttcc 


tagttaacat 


gttgataact 


tcggatttac 


atgttgtata 


720 


tacttgtcat 


ctgtgtttct 


agtaaaaata 


tatggcattt 


atagaaatac 


gtaattcctg 


780 


atttcctttt 


tttttatctc 


tatgctctgt 


gtgtacaggt 


caaacagact 


tcactcctat 


840 


ttttatttat 


agaattttat 


atgcagtctg 


tcgttggttc 


ttgtgttgta 


aggatacagc 


900 


cttaaatttc 


ctagagcgat 


gctcagtaag 


gcgggttgtc 


acatgggttc 


aaatgtaaaa 


960 


cgggcacgtt 


tggctgctgc 


cttcccgaga 


tccaggacac 


taaactgctt 


ctgcactgag 


1020 


gtataaatcg 


cttcagatcc 


cagggaagtg 


cagatccacg 


tgcatattct 


taaagaagaa 


1080 


tgaatacttt 


ctaaaatatt 


ttggcatagg 


aagcaagctg 


catggatttg 


tttgggactt 


1140 


aaattatttt 


ggtaacggag 


tgcataggtt 


ttaaacacag 


ttgcagcatg 


ctaacgagtc 


1200 


acagcgttta 


tgcagaagtg 


atgcctggat 


gcctgttgca 


gctgtttacg 


gcactgcctt 


1260 


gcagtgagca 


ttgcagatag 


gggtggggtg 


ctttgtgtcg 


tgttcccaca 


cgctgccaca 


1320 


cagccacctc 


ccggaacaca 


tctcacctgc 


tgggtacttt 


tcaaaccatc 


ttagcagtag 


1380 


tagatgagtt 


actatgaaac 


agagaagttc 


ctcagttgga 


tattctcatg 


ggatgtcttt 


1440 


tttcccatgt 


tgggcaaagt 


atgataaagc 


atctctattt 


gtaaattatg 


cacttgttag 


1500 


ttcctgaatc 


ctttctatag 


caccacttat 


tgcagcaggt 


gtaggctctg 


gtgtggcctg 


1560 


tgtctgtgct 


tcaatctttt 


aaagcttctt 


tggaaataca 


ctgacttgat 


tgaagtctct 


1620 


tgaagatagt 


aaacagtact 


tacctttgat 


cccaatgaaa 


tcgagcattt 


cagttgtaaa 


1680 


agaattccgc 


ctattcatac 


catgtaatgt 


aattttacac 


ccccagtgct 


gacactttgg 


1740 


aatatattca 


agtaatagac 


tttggcctca 


ccctcttgtg 


tactgtattt 


tgtaatagaa 


1800 
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aatattttaa 


actgtgcata 


tgattattac 


025CIP7SEQ 
attatgaaag 


List.txt 
agacattctg 


ctgatcttca 


1860 


aatgtaagaa 


aatgaggagt 


gcgtgtgctt 


ttataaatac 


aagtgattgc 


aaattagtgc 


1920 


aggtgtcctt 


aaaaaaaaaa 


aaaaaaagta 


atataaaaag 


gaccaggtgt 


tttacaagtg 


1980 


aaatacattc 


ctatttggta 


aacagttaca 


tttttatgaa 


gattaccagc 


gctgctgact 


2040 


ttctaaacat 


aaggctgtat 


tgtcttcctg 


taccattgca 


tttcctcatt 


cccaatttgc 


2100 


acaaggatgt 


ctgggtaaac 


tattcaagaa 


atggctttga 


aatacagcat 


gggagcttgt 


2160 


ctgagttgga 


atgcagagtt 


gcactgcaaa 


atgtcaggaa 


atggatgtct 


ctcagaatgc 


2220 


ccaactccaa 


aggattttat 


atgtgtatat 


agtaagcagt 


ttcctgattc 


cagcaggcca 


2280 


aagagtctgc 


tgaatgttgt 


gttgccggag 


acctgtattt 


ctcaacaagg 


taagatggta 


2340 


tcctagcaac 


tgcggatttt 


aatacatttt 


cagcagaagt 


acttagttaa 


tctctacctt 


2400 


tagggatcgt 


ttcatcattt 


ttagatgtta 


tacttgaaat 


actgcataac 


ttttagcttt 


2460 


catgggttcc 


tttttttcag 


cctttaggag 


actgttaagc 


aatttgctgt 


ccaacttttg 


2520 


tgttggtctt 


aaactgcaat 


agtagtttac 


cttgtattga 


agaaataaag 


accattttta 


2580 


tattaaaaaa 


tacttttgtc 


tgtcttcatt 


ttgacttgtc 


tgatatcctt 


gcagtgccca 


2640 


ttatgtcagt 


tctgtcagat 


attcagacat 


caaaacttaa 


cgtgagctca 


gtggagttac 


2700 


agctgcggtt 


ttgatgctgt 


tattatttct 


gaaactagaa 


atgatgttgt 


cttcatctgc 


2760 


tcatcaaaca 


cttcatgcag 


agtgtaaggc 


tagtgagaaa 


tgcatacatt 


tattgatact 


2820 


tttttaaagt 


caacttttta 


tcagattttt 


ttttcatttg 


gaaatatatt 


gttttctaga 


2880 


ctgcatagct 


tctgaatctg 


aaatgcagtc 


tgattggcat 


gaagaagcac 


agcactcttc 


2940 


atcttactta 


aacttcattt 


tggaatgaag 


gaagttaagc 


aagggcacag 


gtccatgaaa 


3000 


tagagacagt 


gcgctcagga 


gaaagtgaac 


ctggatttct 


ttggctagtg 


ttctaaatct 


3050 


gtagtgagga 


aagtaacacc 


cgattccttg 


aaagggctcc 


agctttaatg 


cttccaaatt 


3120 


gaaggtggca 


ggcaacttgg 


ccactggtta 


tttactgcat 


t at gt ct cag 


tttcgcagct 


318 0 


aacctggctt 


ctccactatt 


gagcatggac 


tatagcctgg 


cttcagaggc 


caggtgaagg 


3240 


ttgggatggg 


tggaaggagt 


gctgggctgt 


ggctgggggg 


actgtgggga 


ctccaagctg 


3300 


agcttggggt 


gggcagcaca 


gggaaaagtg 


tgggtaacta 


tttttaagta 


ctgtgttgca 


3360 






gtdgggtgtg 






f~x\~ f~\f~\f~x^ ;3 i"r 
Sd ^SdSdSd ^ I'^d^ 


3 4 ? 0 


taatatatgg 


atgaattcac 


agtggaagca 


ttcaagggta 


gatcatctaa 


cgacaccaga 


3480 


tcatcaagct 


atgattggaa 


gcggtatcag 


aagagcgagg 


aaggtaagca 


gtcttcatat 


3540 


gttttccctc 


cacgtaaagc 


agtctgggaa 


agtagcaccc 


cttgagcaga 


gacaaggaaa 


3600 


taattcagga 


gcatgtgcta 


ggagaacttt 


cttgctgaat 


tctacttgca 


agagctttga 


3660 


tgcctggctt 


ctggtgcctt 


ctgcagcacc 


tgcaaggccc 


agagcctgtg 


gtgagctgga 


3720 
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025CIP7SEQ 


List .txt 






gggaaagatt 


ctgctcaagt 


ccaagcttca 


gcaggtcatt 


gtctttgctt 


cttcccccag 


3780 


cactgtgcag 


cagagtggaa 


ctgatgtcga 


agcctcctgt 


ccactacctg 


ttgctgcagg 


3840 


cagactgctc 


tcagaaaaag 


agagctaact 


ctatgccata 


gtctgaaggt 


aaaatgggtt 


3900 


ttaaaaaaga 


aaacacaaag 


gcaaaaccgg 


ctgccccatg 


agaagaaagc 


agtggtaaac 


3960 


atggtagaaa 


aggtgcagaa 


gcccccaggc 


agtgtgacag 


gcccctcctg 


ccacctagag 


4020 


gcgggaacaa 


gcttccctgc 


ctagggctct 


gcccgcgaag 


tgcgtgtttc 


tttggtgggt 


4080 


tttgtttggc 


gtttggtttt 


gagatttaga 


cacaagggaa 


gcctgaaagg 


aggtgttggg 


4140 


cactattttg 


gtttgtaaag 


cctgtacttc 


aaatatatat 


tttgtgaggg 


agtgtagcga 


4200 


attggccaat 


ttaaaataaa 


gttgcaagag 


attgaaggct 


gagtagttga 


gagggtaaca 


4260 


cgtttaatga 


gatcttctga 


aactactgct 


tctaaacact 


tgtttgagtg 


gtgagacctt 


4320 


ggataggtga 


gtgctcttgt 


tacatgtctg 


atgcacttgc 


ttgtcctttt 


ccatccacat 


4380 


ccatgcattc 


cacatccacg 


catttgtcac 


ttatcccata 


tctgtcatat 


ctgacatacc 


4440 


tgtctcttcg 


tcacttggtc 


agaagaaaca 


gatgtgataa 


tccccagccg 


ccccaagttt . 


4500 


gagaagatgg 


cagttgcttc 


tttccctttt 


tcctgctaag 


taaggatttt 


ctcctggctt 


4560 


tgacacctca 


cgaaatagtc 


ttcctgcctt 


acattctggg 


cattatttca 


aatatctttg 


4620 


gagtgcgctg 


ctctcaagtt 


tgtgtcttcc 


tactcttaga 


gtgaatgctc 


ttagagtgaa 


4680 


agagaaggaa 


gagaagatgt 


tggccgcagt 


tctctgatga 


acacacctct 


gaataatggc 


4740 


caaaggtggg 


tgggtttctc 


tgaggaacgg 


gcagcgtttg 


cctctgaaag 


caaggagctc 


4800 


tgcggagttg 


cagttatttt 


gcaactgatg 


gtggaactgg 


tgcttaaagc 


agattcccta 


4860 


ggttccctgc 


tacttctttt 


ccttcttggc 


agtcagttta 


tttctgacag 


acaaacagcc 


4920 


acccccactg 


caggcttaga 


aagtatgtgg 


ctctgcctgg 


gtgtgttaca 


gctctgccct 


4980 


ggtgaaaggg 


gattaaaacg 


ggcaccattc 


atcccaaaca 


ggatcctcat 


tcatggatca 


5040 


agctgtaagg 


aacttgggct 


ccaacctcaa 


aacattaatt 


ggagtacgaa 


tgtaattaaa 


5100 


actgcattct 


cgcattccta 


agtcatttag 


tctggactct 


gcagcatgta 


ggtcggcagc 


5160 


tcccactttc 


tcaaagacca 


ctgatggagg 


agtagtaaaa 


atggagaccg 


attcagaaca 


5220 


accaacggag 


tgttgccgaa 


gaaactgatg 


gaaataatgc 


atgaattgtg 


tggtggacat 


5280 


tttttttaaa 


tacataaact 


acttcaaatg 


aggtcggaga 


aggtcagtgt 


tttattagca 


5340 


gccataaaac 


caggtgagcg 


agtaccattt 


ttctctacaa 


gaaaaacgat 


tctgagctct 


5400 


gcgtaagtat 


aagttctcca 


tagcggctga 


agctcccccc 


tggctgcctg 


ccatctcagc 


5460 


tggagtgcag 


tgccatttcc 


ttggggtttc 


tctcacagca 


gtaatgggac 


aatacttcac 


5520 


aaaaattctt 


tcttttcctg 


tcatgtggga 


tccctactgt 


gccctcctgg 


ttttacgtta 


5580 








Page 
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ccccctgact 


gttccattca 


gcggtttgga 


025CIP7SEQ 
aagagaaaaa 


List .txt 
gaatttggaa 


ataaaacatg 


5640 


tctacgttat 


cacctcctcc 


agcattttgg 


tttttaatta 


tgtcaataac 


tggcttagat 


5700 


ttggaaatga 


gagggggttg 


ggtgtattac 


cgaggaacaa 


aggaaggctt 


atataaactc 


5760 


aagtctttta 


tttagagaac 


tggcaagctg 


tcaaaaacaa 


aaaggcctta 


ccaccaaatt 


5820 


aagtgaatag 


ccgctatagc 


cagcagggcc 


agcacgaggg 


atggtgcact 


gctggcacta 


5880 


tgccacggcc 


tgcttgtgac 


tctgagagca 


actgctttgg 


aaatgacagc 


acttggtgca 


5940 


atttcctttg 


tttcagaatg 


cgtagagcgt 


gtgcttggcg 


acagtttttc 


tagttaggcc 


6000 


acttcttttt 


tccttctctc 


ctcattctcc 


taagcatgtc 


tccatgctgg 


taatcccagt 


6060 


caagtgaacg 


ttcaaacaat 


gaatccatca 


ctgtaggatt 


ctcgtggtga 


tcaaatcttt 


6120 


gt gtgaggtc 


tataaaatat 


ggaagcttat 


ttatttttcg 


ttcttccata 


tcagtcttct 


6180 


ctatgacaat 


tcacatccac 


cacagcaaat 


taaaggtgaa 


ggaggctggt 


gggatgaaga 


6240 


gggtcttcta 


gctttacgtt 


cttccttgca 


aggccacagg 


aaaatgctga 


gagctgtaga 


6300 


atacagcctg 


gggtaagaag 


ttcagtctcc 


tgctgggaca 


gctaaccgca 


tcttataacc 


6360 


ccttctgaga 


ctcatcttag 


gaccaaatag 


ggtctatctg 


gggtttttgt 


tcctgctgtt 


6420 


cctcctggaa 


ggctatctca 


ctatttcact 


gctcccacgg 


ttacaaacca 


aagatacagc 


6480 


ctgaattttt 


tctaggccac 


attacataaa 


tttgacctgg 


taccaatatt 


gttctctata 


6540 


tagttatttc 


cttccccact 


gtgtttaacc 


ccttaaggca 


ttcagaacaa 


ctagaatcat 


6600 


agaatggttt 


ggattggaag 


gggccttaaa 


catcatccat 


ttccaaccct 


ctgccatggg 


6660 


ctgcttgcca 


cccactggct 


caggctgccc 


agggccccat 


ccagcctggc 


cttgagcacc 


6720 


tccagggatg 


gggcacccac 


agcttctctg 


ggcagcctgt 


gccaacacct 


caccactctc 


6/80 


tgggtaaaga 


attctctttt 


aacatctaat 


ctaaatctct 


tctcttttag 


tttaaagcca 


6840 


ttcctctttt 


tcccgttgct 


atctgtccaa 


gaaatgtgta 


ttggtctccc 


tcctgcttat 


6900 


aagcaggaag 


tactggaagg 


ctgcagtgag 


gtctccccac 


agcctt ct ct 


t ct ccaggct 


5960 


gaacaagccc 


agctccttca 


gcctgtcttc 


gtaggagatc 


atcttagtgg 


ccctcctctg 


7020 


gacccattcc 


aacagttcca 


cggctttctt 


gtggagcccc 


aggtctggat 


gcagtacttc 


7080 


agatggggcc 


ttacaaaggc 


agagcagatg 


gggacaatcg 


cttacccctc 


cctgctggct 


7140 












^ o "t~ "t~ "t~ 


7 9 n n 


gatttgtgtc 


aagcttttca 


tccaccagaa 


cccacgcttc 


ctggttaata 


cttctgccct 


7260 


cacttctgta 


agcttgtttc 


aggagacttc 


cattctttag 


gacagactgt 


gttacaccta 


7320 


cctgccctat 


tcttgcatat 


atacatttca 


gttcatgttt 


cctgtaacag 


gacagaatat 


7380 


gtattcctct 


aacaaaaata 


catgcagaat 


tcctagtgcc 


atctcagtag 


ggttttcatg 


7440 


gcagtattag 


cacatagtca 


atttgctgca 


agtaccttcc 


aagctgcggc 


ctcccataaa 


7500 
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025CIP7SEQ 


List . tXt 






tcctgtattt 


gggatcagtt 


accttttggg 


gtaagctttt 


gtatctgcag 


agaccctggg 


7560 


ggttctgatg 


tgcttcagct 


ctgctctgtt 


ctgactgcac 


cattttctag 


atcacccagt 


7620 


tgttcctgta 


caacttcctt 


gtcctccatc 


ctttcccagc 


ttgtatcttt 


gacaaataca 


7680 


ggcctatttt 


tgtgtttgct 


tcagcagcca 


tttaattctt 


cagtgtcatc 


ttgttctgtt 


7740 


gatgccactg 


gaacaggatt 


ttcagcagtc 


ttgcaaagaa 


catctagctg 


aaaactttct 


7800 


gccattcaat 


attcttacca 


gttcttcttg 


tttgaggtga 


gccataaatt 


actagaactt 


7860 


cgtcactgac 


aagtttatgc 


attttattac 


ttctattatg 


tacttacttt 


gacataacac 


7920 


agacacgcac 


atattttgct 


gggatttcca 


cagtgtctct 


gtgtccttca 


catggtttta 


7980 


ctgtcatact 


tccgttataa 


ccttggcaat 


ctgcccagct 


gcccatcaca 


agaaaagaga 


8040 


ttcctttttt 


attacttctc 


ttcagccaat 


aaacaaaatg 


tgagaagccc 


aaacaagaac 


8100 


ttgtggggca 


ggctgccatc 


aagggagaga 


cagctgaagg 


gttgtgtagc 


tcaatagaat 


8160 


taagaaataa 


taaagctgtg 


tcagacagtt 


ttgcctgatt 


tatacaggca 


cgccccaagc 


8220 


cagagaggct 


gtctgccaag 


gccaccttgc 


agtccttggt 


ttgtaagata 


agtcataggt 


8280 


aacttttctg 


gtgaattgcg 


tggagaatca 


tgatggcagt 


tcttgctgtt 


tactatggta 


8340 


agatgctaaa 


ataggagaca 


gcaaagtaac 


acttgctgct 


gtaggtgctc 


tgctatccag 


8400 


acagcgatgg 


cactcgcaca 


ccaagatgag 


ggatgctccc 


agctgacgga 


tgctggggca 


8460 


gtaacagtgg 


gtcccatgct 


gcctgctcat 


tagcatcacc 


tcagccctca 


ccagcccatc 


8b20 


agaaggatca 


tcccaagctg 


aggaaagttg 


ctcatcttct 


tcacatcatc 


aaacctttgg 


8580 


cctgactgat 


gcctcccgga 


tgcttaaatg 


tggtcactga 


catctttatt 


tttctatgat 


8640 


ttcaagtcag 


aacctccgga 


tcaggaggga 


acacatagtg 


ggaatgtacc 


ctcagctcca 


8700 


aggccagatc 


ttccttcaat 


gatcatgcat 


gctacttagg 


aaggtgtgtg 


tgtgtgaatg 


8760 


tagaattgcc 


tttgttattt 


tttcttcctg 


ctgtcaggaa 


cattttgaat 


accagagaaa 


8820 


aagaaaagtg 


ctcttcttgg 


catgggagga 


gttgtcacac 


ttgcaaaata 


aaggatgcag 


8880 


tcccaaatgt 


tcataatctc 


agggtctgaa 


ggaggatcag 


aaactgtgta 


tacaatttca 


8940 


ggcttctctg 


aatgcagctt 


ttgaaagctg 


ttcctggccg 


aggcagtact 


agtcagaacc 


9000 


ctcggaaaca 


ggaacaaatg 


tcttcaaggt 


gcagcaggag 


gaaacacctt 


gcccatcatg 


9060 


aa acr1~ craat" a 


apnap"t"crppcr 

^ CL \^ \^ *>J \^ \^ *>J 


p 1" cr a a crcr a a 1" 




t" 1" t" cracTP acrcr 

O ^ O *^ *^ \^ *^ 


^ *^ ^ ^ *^ ^ 


9120 


tcccacactg 


aaacaacagt 


tcatttttat 


aggacttcca 


ggaaggatct 


tcttcttaag 


9180 


cttcttaatt 


atggtacatc 


tccagttggc 


agatgactat 


gactactgac 


aggagaatga 


9240 


ggaactagct 


gggaatattt 


ctgtttgacc 


accatggagt 


cacccatttc 


tttactggta 


9300 


tttggaaata 


ataattctga 


attgcaaagc 


aggagttagc 


gaagatcttc 


atttcttcca 


9360 
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tgttggtgac 
atagggataa 
aattgagacc 
gaccccagct 
catacaactg 
gggttagtaa 
ctcaggtgga 
cttaggacct 
ctggccacag 
tcccagtttg 
ccagctatgg 
ccacaagagt 
attgaccacg 
aactgatact 
gggctctaca 
aagctggcct 
agtgtctgca 
atatgcaggc 
ttcaaagtct 
gtagttcagc 
agattagtgg 
gggaccccaa 
gatctggtac 
ggatggagca 
gaagaacttt 
ccttgtgcta 
caatgaggtc 
gcctgtcttt 
cccgctccaa 
ccagatgggg 
ccagccctct 
atcagttcca 



agcacagttc 
taaatctaag 
tctactggat 
gaggtcaagt 
taattttagc 
tgttaatctc 
aatcaaggat 
ttatgtacag 
ttcagctgag 
tacagttctg 
tcagggagtg 
ggcttttgtt 
aggcaacatt 
acaatgcaat 
ggagtagcta 
cctaaagagg 
gttccccatg 
gtactgcttc 
agt agaa agt 
ttat ttatgc 
taaaacctac 
ggatcatgaa 
tagaccaggc 
tccacaacct 
tccctgacat 
tcactgtcta 
tccttgcagc 
ataggagagg 
gagctccaca 
cctcaaaaga 
tctgatggag 
ctattaaaac 



tggctatgaa 
tttgaagaca 
agctatggta 
atgaactcag 
agtgatttag 
agggaaaaaa 
cacagctcag 
ggcctcaaga 
gcaatcctgg 
gcactttttg 
cctgaccgtc 
gaaattgcag 
aatcctcagc 
cagtatcaac 
ctctctaatg 
ctgctaaaca 
caaagctgcc 
ttgccagcac 
aacaagcttt 
ccactagaaa 
tttatacaat 
gatccaacac 
agcccagggc 
ctctgggcag 
ccaatctaag 
ctcttgtaaa 
cttcttctct 
tgctccagcc 
tctttcctgt 
gcagagtaaa 
ccctggatac 
aggaacaata 



025CIP7SEQ 
agtctgctta 

atgaggtttt 

tttacgtgtc 

gtctctcggg 

ggtttatgag 

aaaaaaagcc 

tgcggtccca 

taactgatgt 

aattttctct 

ggtcaggccg 

ccaactcact 

tgtggcccag 

aagtgcaatt 

aagtggtttg 

gagttgcatt 

ttagggtcaa 

caaacatagc 

tgtccttctc 

gaatgtcatt 

catcttgtac 

catagaatca 

ccccgccaca 

tccatccaac 

cctgtgccag 

ccttccctcc 

aagttgattc 

tctgcaggat 

ctctgatcat 

actgggggcc 

gagggacaat 

aactggcttt 

caacaggtgc 
Page 



List.tXt 
caaggaagag 

agctgcattt 

tttttgctta 

ctactggcat 

tacttttgca 

aaccctgaca 

gagaacacag 

tagtcagaag 

ccgctgcaca 

tgatccaagg 

gcactcaaac 

aggggctgca 

tgcagccatt 

gcttggaaga 

ttgaagcagg 

ttttccagtg 

acttccaatt 

aaatgaactc 

aaaaagtata 

aagctgaaca 

tagaatggcc 

ggcagggcca 

ctggccatga 

cacctcacca 

ttgaggttag 

tcctcctttt 

gaacaagccc 

ctttgtggcc 

ccaggcctga 

caccttcctc 

ctgagctgca 

tgatggccag 
25 



gataaaaatc 
gacatgaaga 
gttacttatt 
ggattgatta 
gtaaatcata 
gacatcccag 
ggactcttct 
actttccatt 
gttccagtca 
agcagaagtt 
aaaggcgaaa 
ccagtactgg 
aaattgaact 
tggagtctag 
acactgtgaa 
cactttctga 
gaatacaatt 
aacaaacaat 
tctgctttca 
ctggggctcc 
tgggttggaa 
ccaacctcca 
acacctccag 
ccctctctgt 
atccactccc 
tggaaggttg 
agctccctca 
ctcctctgga 
atgcagtact 
accctgctgg 
acttctcctt 
tgcagagttt 



9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
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ttcacacttc 
gtgcttcttc 
tccatggccc 
tagcacagta 
acttgagaag 
aaacttgttt 
acaatggcta 
gtggcttagc 
tgtttgaagt 
agaggaatca 
agtcccgctg 
tcctgcccct 
ggaccctgat 
ggcacgattt 
tccctgtgct 
gcgctgcttg 
acgatctgga 
aggatagcat 
agaagtacag 
tgagcaccaa 
gccggccgct 
gaatgcagtg 
ccattataag 
ttcaggggga 
tcgataagga 
tgggcgcggg 
gtaggacagg 
cgttcggctg 
atcaggggat 
taaaaaggcc 
aaatcgacgc 



ttcatttcgg 
ctcctcaaat 
cctgcagcca 
tgaccaagtt 
aagagccaaa 
atctaccatg 
tgcagtaaaa 
caatgtggta 
gttgggaaat 
aaagggggtg 
tgtgtacgac 
ggctgcctta 
gctgctggct 
tggctttcct 
gcacgagatg 
ggatgagacc 
ggcttgcgtg 
cctggctgtg 
cccctgcgct 
cctgcaagag 
tcgagcagac 
aaaaaaatgc 
ctgcaataaa 
ggtgtgggag 
tccgtcgacc 
gcatgactat 
tgccggcagc 
cggcgagcgg 
aacgcaggaa 
gcgttgctgg 
tcaagtcaga 



tagatcttag 
actcctgcct 
gggccctgat 
gcacctatga 
tttgcattgt 
gctgttttta 
tcaagactgt 
cttcccacat 
ttctgtatac 
ggaggaagtt 
actggcaaca 
gggtgcgatc 
cagatgagga 
caagaggagt 
atccagcaga 
ctgctggata 
atccagggcg 
aggaagtact 
tgggaagtcg 
agcttgaggt 
atgataagat 
tttatttgtg 
caagttaaca 
gttttttaaa 
gatgcccttg 
cgtcgccgca 
gctcttccgc 
tatcagctca 
agaacatgtg 
cgtttttcca 
ggtggcgaaa 



025CIP7SEQ 
atgaggaacg 
gatacctcac 
gaacccggca 
atacacaaac 
caggaaatgg 
tggctgttag 
agatattgca 
tgtataagaa 
tcaagagggc 
aaaagaagag 
tgaggtcttt 
tgcctcagac 
gaatcagcct 
ttggcaacca 
tctttaacct 
agttttacac 
tgggcgtgac 
ttcagaggat 
tgagggctga 
ctaaggagta 
acattgatga 
aaatttgtga 
acaacaattg 
gcaagtaaaa 
agagccttca 
cttatgactg 
ttcctcgctc 
ctcaaaggcg 
agcaaaaggc 
taggctccgc 
cccgacagga 
Page 



List .tXt 
ttgaagttgt 
cccacctgcc 
ctgcttcaga 
aatgtgttgc 
tttagtaatt 
tagtggtaca 
acagactata 
atttggcaag 
gtttttgaca 
gcaggtgcaa 
gctaatcttg 
ccacagcctg 
gtttagctgc 
gtttcagaag 
gtttagcacc 
cgagctgtac 
cgagacccct 
caccctgtac 
gatcatgagg 
aaaagtctag 
gtttggacaa 
tgctattgct 
cattcatttt 
cctctacaaa 
acccagtcag 
tcttctttat 
actgactcgc 
gtaatacggt 
cagcaaaagg 
ccccctgacg 
ctataaagat 

26 



gcttctgcgt 
actgaatggc 
tgctgtttaa 
atccttcagc 
ctgccaatta 
ctgatgatga 
aaattcctct 
tttagagcaa 
actgtagaac 
gagagcttgc 
gtgctttgct 
ggcagcagga 
ctgaaggata 
gctgagacca 
aaggatagca 
cagcagctga 
ctgatgaagg 
ctgaaggaga 
agctttagcc 
agtcggggcg 
accacaacta 
ttatttgtaa 
atgtttcagg 
tgtggtaaaa 
ctccttccgg 
catgcaactc 
tgcgctcggt 
tatccacaga 
ccaggaaccg 
agcatcacaa 
accaggcgtt 



11340 
11400 
11450 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12/80 
12840 
12900 
12960 
13020 
13080 
13140 
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tccccctgga 
gtccgccttt 
cagttcggtg 
cgaccgctgc 
atcgccactg 
tacagagttc 
ctgcgctctg 
acaaaccacc 
aaaaggatct 
aaactcacgt 
tttaaattaa 
cagttaccaa 
catagttgcc 
ccccagtgct 
aaaccagcca 
ccagtctatt 
caacgttgtt 
attcagctcc 
agcggttagc 
actcatggtt 
ttctgtgact 
ttgctcttgc 
gctcatcatt 
atccagttcg 
cagcgtttct 
gacacggaaa 
gggttattgt 
ggttccgcgc 
cgcggcgggt 
cgctcctttc 
tctaaatcgg 
aaaacttgat 



agctccctcg 
ctcccttcgg 
taggtcgttc 
gccttatccg 
gcagcagcca 
ttgaagtggt 
ctgaagccag 
gctggtagcg 
caagaagatc 
taagggattt 
aaatgaagtt 
tgcttaatca 
tgactccccg 
gcaatgatac 
gccggaaggg 
aattgttgcc 
gccattgcta 
ggttcccaac 
tccttcggtc 
atggcagcac 
ggtgagtact 
ccggcgtcaa 
ggaaaacgtt 
atgtaaccca 
gggtgagcaa 
tgttgaatac 
ctcatgagcg 
acatttcccc 
gtggtggtta 
gctttcttcc 
gggctccctt 
tagggtgatg 



tgcgctctcc 
gaagcgtggc 
gctccaagct 
gtaactatcg 
ctggtaacag 
ggcctaacta 
ttaccttcgg 
gtggtttttt 
ctttgatctt 
tggtcatgag 
ttaaatcaat 
gtgaggcacc 
tcgtgtagat 
cgcgagaccc 
ccgagcgcag 
gggaagctag 
caggcatcgt 
gatcaaggcg 
ctccgatcgt 
tgcataattc 
caaccaagtc 
tacgggataa 
cttcggggcg 
ctcgtgcacc 
aaacaggaag 
tcatactctt 
gatacatatt 
gaaaagtgcc 
cgcgcagcgt 
cttcctttct 
tagggttccg 
gttcacgtag 



025CIP7SEQ 
tgttccgacc 

gctttctcaa 

gggctgtgtg 

tcttgagtcc 

gattagcaga 

cggctacact 

aaaaagagtt 

tgtttgcaag 

ttctacgggg 

attatcaaaa 

ctaaagtata 

tatctcagcg 

aactacgata 

acgctcaccg 

aagtggtcct 

agtaagtagt 

ggtgtcacgc 

agttacatga 

tgtcagaagt 

tcttactgtc 

attctgagaa 

taccgcgcca 

aaaactctca 

caactgatct 

gcaaaatgcc 

cctttttcaa 

tgaatgtatt 

acctgacgcg 

gaccgctaca 

cgccacgttc 

atttagtgct 

tgggccatcg 
Page 



List.tXt 

ctgccgctta 

tgctcacgct 

cacgaacccc 

aacccggtaa 

gcgaggtatg 

agaaggacag 

ggtagctctt 

cagcagatta 

tctgacgctc 

aggatcttca 

tatgagtaaa 

atctgtctat 

cgggagggct 

gctccagatt 

gcaactttat 

tcgccagtta 

t cy Ley L L Ly 

tcccccatgt 

aagttggccg 

atgccatccg 

tagtgtatgc 

catagcagaa 

aggatcttac 

tcagcatctt 

gcaaaaaagg 

tattattgaa 

tagaaaaata 

ccctgtagcg 

cttgccagcg 

gccggctttc 

ttacggcacc 

ccctgataga 
27 



ccggatacct 
gtaggtatct 
ccgttcagcc 
gacacgactt 
taggcggtgc 
tatttggtat 
gatccggcaa 
cgcgcagaaa 
agtggaacga 
cctagatcct 
cttggtctga 
ttcgttcatc 
taccatctgg 
tatcagcaat 
ccgcctccat 
atagtttgcg 
y LdLyycttc 
tgtgcaaaaa 
cagtgttatc 
taagatgctt 
ggcgaccgag 
ctttaaaagt 
cgctgttgag 
ttactttcac 
gaataagggc 
gcatttatca 
aacaaatagg 
gcgcattaag 
ccctagcgcc 
cccgtcaagc 
tcgaccccaa 
cggtttttcg 



13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14450 
14520 
14580 
14540 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
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ccctttgacg 
actcaaccct 
ttggttaaaa 
gtttacaatt 
ggcctcttcg 
ggaggtactt 
ttattttcat 
ctccatcaaa 
gcaggtgcca 
agcaggatct 
gcataaatca 
atttatattg 
taatagtaat 
taacttacgg 
ataatgacgt 
gagtatttac 
ccccctattg 
ttacgggact 
atgcggtttt 
agtctccacc 
ccaaaatgtc 
gaggtctata 
cgctgttttg 
tctaggggct 
accggtgccg 
catgaccgag 
acgcaccctc 
ccgccacatc 
catcggcaag 
gagcgtcgaa 
ttcccggctg 



ttggagtcca 
atctcggtct 
aatgagctga 
tcccattcgc 
ctattacgcc 
ggagcggccg 
tacatctgtg 
acaaaacgaa 
gaacatttct 
atacattgaa 
atattggcta 
gctcatgtcc 
caattacggg 
taaatggccc 
atgttcccat 
ggtaaactgc 
acgtcaatga 
ttcctacttg 
ggcagtacat 
ccattgacgt 
gtaacaactc 
taagcagagc 
acctccatag 
cgagatctgc 
ccaccatccc 
tacaagccca 
gccgccgcgt 
gagcgggtca 
gtgtgggtcg 
gcgggggcgg 
gccgcgcagc 



cgttctttaa 
attcttttga 
tttaacaaaa 
cattcaggct 
agcccaagct 
ctctagaact 
tgttggtttt 
acaaaacaaa 
ctatcgatag 
tcaatattgg 
ttggccattg 
aatatgaccg 
gtcattagtt 
gcctggctga 
agtaacgcca 
ccacttggca 
cggtaaatgg 
gcagtacatc 
caatgggcgt 
caatgggagt 
cgccccattg 
tcgtttagtg 
aagacaccgg 
gatctaagta 
ctgacccacg 
cggtgcgcct 
tcgccgacta 
ccgagctgca 
cggacgacgg 
tgttcgccga 
aacagatgga 



025CIP7SEQ 
tagtggactc 
tttataaggg 
atttaacgcg 
gcgcaactgt 
accatgataa 
agtggatccc 
ttgtgtgaat 
ctagcaaaat 
gtaccgagct 
caattagcca 
catacgttgt 
ccatgttgac 
catagcccat 
ccgcccaacg 
atagggactt 
gtacatcaag 
cccgcctggc 
tacgtattag 
ggatagcggt 
ttgttttggc 
acgcaaatgg 
aaccgtcaga 
gaccgatcca 
agcttgcatg 
cccctgaccc 
cgccacccgc 
ccccgccacg 
agaactcttc 
cgccgcggtg 
gatcggcccg 
aggcctcctg 
Page 



List.tXt 

ttgttccaaa 
attttgccga 
aattttaaca 
tgggaagggc 
gtaagtaata 
ccggccgcaa 
cgatagtact 
aggctgtccc 
cttacgcgtg 
tattagtcat 
atctatatca 
attgattatt 
atatggagtt 
acccccgccc 
tccattgacg 
tgtatcatat 
attatgccca 
tcatcgctat 
ttgactcacg 
accaaaatca 
gcggtaggcg 
tcgcctggag 
gcctcccctc 
cctgcaggtc 
ctcacaagga 
gacgacgtcc 
cgccacaccg 
ctcacgcgcg 
gcggtctgga 
cgcatggccg 
gcgccgcacc 
28 



ctggaacaac 
tttcggccta 
aaatattaac 
gatcggtgcg 
ttaaggtacg 
taaaatatct 
aacatacgct 
cagtgcaagt 
ctagccctcg 
tggttatata 
taatatgtac 
gactagttat 
ccgcgttaca 
attgacgtca 
tcaatgggtg 
gccaagtccg 
gtacatgacc 
taccatggtg 
gggatttcca 
acgggacttt 
tgtacggtgg 
acgccatcca 
gaagctcgac 
ggccgccacg 
gacgaccttc 
cccgggccgt 
tcgacccgga 
tcgggctcga 
ccacgccgga 
agttgagcgg 
ggcccaagga 



15120 

15180 
15240 
15300 
15360 
15420 
15480 
15540 
15500 
15660 
15720 
15780 
15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
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qcccqcqtqq 


ttcctggcca 


ccqtcqqcqt 


025CIP7SEQ 
ctcgcccgac 


List.tXt 
caccagggca 


agqqtctqqq 

3 3 ^ Zi 3 


16980 


cagcgccgtc 


gtgctccccg 


qaqtqqaqqc 


qqccqaqcqc 


qccqqqqtgc 

13 13 13 13 13 13 


ccgccttcct 


17040 


ggagacctcc 


gcgccccgca 


acctcccctt 


ctacqaqcqg 

13 13 13 13 


ctcggcttca 


ccgtcaccgc 


17100 


cgacgtcgag 


gtgcccgaag 


gaccgcgcac 


ctggtgcatg 


acccgcaagc 


ccggtgcctg 


17160 


acgcccgccc 


cacgacccgc 


agcgcccgac 


cgaaaggagc 


gcacgacccc 


atggctccga 


17220 


ccgaaqccga 


cccgggcggc 


cccgccgacc 


ccgcacccgc 


ccccgaggcc 


caccgactct 


17280 


agagtcgggg 


cggccggccg 


cttcgagcag 


acatgataag 


atacattgat 


gagtttggac 


17340 


aaaccacaac 


tagaatgcag 


tgaaaaaaat 


gctttatttg 


tgaaatttgt 


gatgctattg 


17400 


ctttatttgt 


aaccattata 


agctgcaata 


aacaagttaa 


caacaacaat 


tgcattcatt 


17460 


ttatgtttca 


ggttcagggg 


gaggtgtggg 


aggtttttta 


aagcaagtaa 


aacctctaca 


17520 


aatgtggtaa 


aatcgataag 


gatcaattcg 


gcttcaggta 


ccgtcgacga 


tgtaggtcac 


17580 


ggtctcgaag 


ccgcggtgcg 


ggtgccaggg 


cgtgcccttg 


ggctccccgg 


gcgcgtactc 


17640 


cacctcaccc 


atctggtcca 


tcatgatgaa 


cgggtcgagg 


tggcggtagt 


tgatcccggc 


17700 


qaacqcqcqq 


cqcaccqqqa 


agccctcgcc 


ctcgaaaccg 


ctggqcgcgg 


tggtcacqqt 


17760 


gagcacggga 


cgtgcgacgg 


cgtcggcggg 


tgcggatacg 


cggggcagcg 


tcagcgggtt 


17820 


ctcgacggtc 


acggcgggca 


tgtcgacagc 


cgaattgatc 


cgtcgaccga 


tgcccttgag 


17880 


agccttcaac 


ccagtcagct 


ccttccggtg 


ggcgcggggc 


atgactatcg 


tcgccgcact 


17940 


tatgactgtc 


ttctttatca 


tgcaactcgt 


aggacaggtg 


ccggcagcgc 


tcttccgctt 


18000 


cr.t r.qr.t r.ar. 


tgantcgctg 


cxjc.t cggt eg 


ttcggctgcg 


gcgagcggta 


tcagctcact 


18050 


caaaggcggt 


aatacggtta 


tccacagaat 


caggggataa 


cgcaggaaag 


aacatg 


18116 


<210> 8 
<211> 17402 
<212> DNA 

<213> Artificial Sequence 










<220> 

<223> Plasmid pOMlFN- 


■Ins-CMV-pur 


-attB 








<400> 8 
ggccgccacc 


gcggtggagc 


tccaattcgc 


cctatagtga 


gtcgtattac 


aattcactgg 


60 


ccgtcgtttt 


acaacgtcgt 


gactgggaaa 


accctggcgt 


tacccaactt 


aatcgccttg 


120 


cagcacatcc 


ccctttcgcc 


agctggcgta 


atagcgaaga 


ggcccgcacc 


gatcgccctt 


180 


cccaacagtt 


gcgcagcctg 


aatggcgaat 


gggacgcgcc 


ctgtagcggc 


gcattaagcg 


240 


cggcgggtgt 


ggtggttacg 


cgcagcgtga 


ccgctacact 


tgccagcgcc 


ctagcgcccg 


300 


ctcctttcgc 


tttcttccct 


tcctttctcg 


ccacgttcgc 


cggctttccc 


cgtcaagctc 


360 


taaatcgggg 


gctcccttta 


gggttccgat 


ttagtgcttt 


acggcacctc 


gaccccaaaa 


420 
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025CIP7SEQ 


List . tXt 








gggtgatggt 


tcacgtagtg 


ggccatcgcc 


ctgatagacg 


gtttttcgcc 


480 


ctttgacgtt 


ggagtccacg 


ttctttaata 


gtggactctt 


gttccaaact 


ggaacaacac 


540 


tcaaccctat 


ctcggtctat 


tcttttgatt 


tataagggat 


tttgccgatt 


tcggcctatt 


600 


ggttaaaaaa 


tgagctgatt 


taacaaaaat 


ttaacgcgaa 


ttttaacaaa 


atattaacgc 


650 


ttacaattta 


ggtggcactt 


ttcggggaaa 


tgtgcgcgga 


acccct attt 


gtttattttt 


720 


ctaaatacat 


tcaaatatgt 


atccgctcat 


gagacaataa 


ccctgataaa 


tgcttcaata 


780 


atattgaaaa 


aqqaaqaqta 


tgagtattca 


acatttccgt 


gtcgccctta 


tt cccttttt 


840 


tgcggcattt 


tgccttcctg 


tttttgctca 


cccagaaacg 


ctggtgaaag 


taaaagatgc 


900 


tgaagatcag 


ttgggtgcac 


gagtgggtta 


catcgaactg 


gatctcaaca 


gcggtaagat 


960 


ccttgagagt 


tttcgccccg 


aagaacgttt 


tccaatgatg 


agcactttta 


aagttctgct 


1020 


atgt ggcgcg 


gtattatccc 


gtattgacgc 


cgggcaagag 


caactcggtc 


gccgcataca 


1080 


ctattctcag 


aatgacttgg 


ttgagtactc 


accagtcaca 


gaaaagcatc 


ttacggatgg 


1140 


catgacagta 


agagaattat 


gcagtgctgc 


cataaccatg 


agtgataaca 


ctgcggccaa 


1200 


cttacttctg 


acaacgatcg 


qaqqaccqaa 


ggagctaacc 


gcttttttgc 


acaacatggg 


1260 


ggatcatgta 


actcgccttg 


atcqttqqqa 


accqqaqctq 


aatgaagcca 


taccaaacga 


1320 


cgagcgtgac 


accacgatgc 


ctgtagcaat 


ggcaacaacg 


ttgcgcaaac 


tattaactgg 


1380 


cgaactactt 


actctagctt 


cccggcaaca 


attaatagac 


tqqatqqaqq 


cggataaagt 


1440 


tgcaggacca 


cttctgcgct 


cggcccttcc 


ggctggctgg 


tttattgctg 


ataaatctgg 


1500 


agccggtgag 


cgtgggtctc 


gcggtatcat 


tgcagcactg 


gggccagatg 


gtaagccctc 


1560 


ccgtatcgta 


gttatctaca 


cqacqqqqaq 
^^^^^^ 


tcaggcaact 


atggatgaac 


gaaatagaca 


1620 


gatcgctgag 


ataggtgcct 


cactgattaa 


gcattggtaa 


ctgtcagacc 


aagtttactc 


1680 


atatatactt 


tagattgatt 


taaaacttca 


tttttaattt 


aaaaggatct 


aggtgaagat 


1740 


cctttttgat 


aatctcatga 


ccaaaatccc 


ttaacgtgag 


ttttcgttcc 


actgagcgtc 


1800 


agaccccgta 


gaaaagatca 


aaggatcttc 


ttgagatcct 


ttttttctgc 


gcgtaatctg 


1860 


ctgcttgcaa 


acaaaaaaac 


caccgctacc 


agcggtggtt 


tgtttgccgg 


atcaagagct 


1920 


accaactctt 


tttccgaagg 


taactggctt 


cagcagagcg 


cagataccaa 


atactgtcct 


1980 


tctagtgtag 


ccgtagttag 


gccaccactt 


caagaactct 


gtagcaccgc 


ctacatacct 


2040 


cgctctgcta 


atcctgttac 


cagtggctgc 


tgccagtggc 


gataagtcgt 


gtcttaccgq 


2100 


gttggactca 


agacgatagt 


taccggataa 


ggcgcagcgg 


tcgggctgaa 


cggggggttc 


2150 


gtgcacacag 


cccagcttgg 


agcgaacgac 


ctacaccgaa 


ctgagatacc 


tacagcgtga 


2220 


gctatgagaa 


agcgccacgc 


ttcccgaagg 


gagaaaggcg 


gacaggtatc 


cggtaagcgg 


2280 



Page 30 



wo 2006/093847 



PCT/US2006/006752 



cagggtcgga 


acaggagagc 


gcacgaggga 


025CIP7SEQ 
gctt ccaggg 


List . txt 
ggaaacgcct 


ggtatcttta 


2340 


tagtcctgtc 


gggtttcgcc 


acctctgact 


tgagcgtcga 


tttttgtgat 


gctcgtcagg 


2400 


crqcrcrccrcracrc 


ctatggaaaa 


acgccagcaa 


cgcggccttt 


ttacggttcc 


tggccttttg 


2460 


ctggcctttt 


gctcacatgt 


tctttcctgc 


gttatcccct 


gattctgtgg 


ataaccgtat 


2520 


taccgccttt 


gagtgagctg 


ataccgctcg 


ccgcagccga 


acgaccgagc 


gcagcgagtc 

3 3 3 ^ w 


2580 


agtgagcgag 


gaagcggaag 


agcgcccaat 


acgcaaaccg 


cctctccccg 


cgcgttggcc 


2640 


gattcattaa 


tgcagctggc 


acgacaggtt 


tcccgactgg 


aaagcgggca 


gtgagcgcaa 


2700 


cgcaattaat 


qtqaqttaqc 


tcactcatta 


ggcaccccag 


gctttacact 


ttatgcttcc 


2/60 


qqctcqtatq 


ttqtqtqqaa 


ttqtqaqcqq 

3 3 3 3 3 


ataacaattt 


cacacaggaa 


acagctatga 


2820 


ccatgattac 


gccaagctcg 


aaattaaccc 


tcactaaagg 


gaacaaaagc 


tgggtaccgg 


2880 


gccccccctc 


qactaqaqqq 


acagcccccc 


cccaaagccc 


ccaqqqatqt 


aattacgtcc 


2940 


ctcccccgct 


agggggcagc 


agcgagccgc 

3 3 3 J 


ccggggctcc 

3 3 3 3 


gctccggtcc 


ggcgctcccc 


3000 


ccgcatcccc 


gagccggcag 


cqtqcqqqqa 
^^^^^^^^^^ 


cagcccgggc 


acggggaagg 


tggcacggga 


3060 


tcgctttcct 


ctgaacgctt 


ctcgctgctc 


tttgagcctg 


cagacacctg 


gggggat acg 


3120 


qgqaaaaagc 


tttagqctqa 


aaqaqaqatt 


tagaatgaca 


gaatcataga 


acggcctggg 


3180 


ttgcaaagga 


gcacagtgct 


catccagatc 


caaccccctg 


ctatgtgcag 


ggtcatcaac 


3240 


cagcagccca 


ggctgcccag 


agccacatcc 


agcctggcct 


tgaatgcctg 


cagqqatqqq 


3300 


gcatccacag 


cctccttggg 


caacctgttc 


agtgcgtcac 


caccctctgg 


gggaaaaact 


3360 


gcctcctcat 


atccaaccca 


aacctcccct 


gtctcagtgt 


aaagccatt c 


cccct t gt cc 


3 42 0 


tatcaagggg 


gagtttgctg 


tgacattgtt 


ggtctggggt 


gacacatgtt 


tgccaattca 


3480 


gtgcatcacg 


gagaggcaga 


tcttggggat 


aaggaagtgc 


aggacagcat 


ggacgtggga 


3540 


catgcaggtg 


ttgagggctc 


tgggacactc 


tccaagtcac 


agcgttcaga 


acagccttaa 


3500 


ggataagaag 


ataggataga 


aggacaaaga 


gcaagttaaa 


acccagcatq 


qaqaggagca 


3660 


caaaaaggcc 


acagacactg 


ctggtccctg 


tgtctgagcc 


tgcatgtttg 


atggtgtctg 


3720 


gatgcaagca 


gaaqgqqtqq 

^ ^^^^ 33 


aagagcttgc 


ctggagagat 


acagctgggt 


cagtaggact 


3780 


gggacaggca 


gctggagaat 


tgccatgtag 


atgttcatac 


aatcgtcaaa 


tcatgaaggc 


3840 


tggaaaagcc 


ctccaagatc 


cccaagacca 


accccaaccc 


acccaccgtg 


cccactggcc 


3900 


atgtccctca 


gtgccacatc 


cccacagttc 


ttcatcacct 


ccagggacgg 


tgaccccccc 


3960 


acctccgtgg 


gcagctgtgc 


cactgcagca 


ccgctctttg 


gagaaggtaa 


atcttgctaa 


4020 


atccagcccg 


accctcccct 


ggcacaacgt 


aaggccatta 


tctctcatcc 


aactccagga 


4080 


cggagtcagt 


gaggatgggg 


ctctagtcga 


ggtcgacggt 


atcgataagc 


ttgattaggc 


4140 


agagcaatag 


gactctcaac 


ctcgtgagta 


tggcagcatg 


ttaactctgc 


actggagtcc 


4200 
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025CIP7SEQ 


List . tXt 








acaatctgcc 


ttgcacat ga 


gt ctt cgt gg 


gccaatattc 


cccaacggt t 


4250 


1^ 1^ 1^ 1^ LA. ^ \^ 


tt at ct t at c 


L- L- CJ. CJ. ^ \ C 


Ci CI CI \ ■ CI o ^ 


tttttaataa 


d. CJ. d. d. \^ \^ CL 


4320 


1 1 c a. a. c g t 


tt atctgt ct 


taccttagtg 


tcacgtttca 


tccctattcc 


cctttctcct 


43 80 


cct ccg't 


Q't acaca.Q'tQ' 


gt gcacactg 


gt t ct t ct gt 


tgatgttctg 


ctctgacagc 


4440 


ca.a.t Q't gggt 


a. a. agt t ct t c 


pt crpp apcrt cr 


tctgtgtt gt 


tttcacttca 


aaaacrcrcTPPP 

d y ^4 V— <• Vhrf' 


4500 


tgggctcccc 


1 1 gga.gct ct 


caggcatt t c 


cttaatcatc 


ac agt cac gc 


t ggcaggatt 


4550 


sgt ccct cct 


a.aaccttaga 


at gac ctgaa 


cgtgt get cc 


ct ett tgt ag 


t cagt gcagg 


4520 




cct caagat c 


agggt ccat c 


t cacccacag 


ggccatt ccc 


aagat gaggt 


4680 


gga.tggtt t a. 


ct ct cacaaa 


aagttttctt 


at gtt tggct 


agaaaggaga 


act cactgcc 


4740 


t acct gtga.a. 


tt cccct agt 


cctggtt ctg 


ct geeaetge 


t gcet gtgea 


gcct gt ccca 


4800 


t cr cr a cr cr cr cr cr c 


agcaactgct 


gt cacaaagg 


tgat cecace 


ct gt ct ccac 


t gaaatgacc 


4860 


t cagt gccac 


gt gtt gt at a 


gggt at aaag 


t a cgggaggg 


ggatgcccgg 


ct ccctt cag 


4920 


ggt tgcagag 


cagaagtgt c 


tgtgt at aga 


gt gtgt et t a 


at ct att aat 


gt aacagaac 


4980 


C-i Nw- L#« O CL^H O 


ct aat atttt 


crt crcrcrpt crcra 


at t crp ppat cr 


t crcrt acrcrcrap 

L#« y CL y CL 


acrcrppt crpt a 


50 40 


C-4 L#« CL ^rii' CL 


at PCTPPt at cr 


1 1 Pt craacrcrt 


at 1 1 crcrcraaa 

ci y y CL CL cii 


craaacrcrcrat t 

y CL cLy *y cl l#« 


t crcrcrcrcrat t cr 


5100 




cip 1 1" t aat t cr 


aat crcrpaaat 


papacrcraaacr 


pacrtt ptcrpt 


paappcrl" 1" crcr 


5160 


tt gtt t cage 


caatt ctt gc 


agccaaagag 


ccgggt gccc 


agcgat at aa 


t agt tgt cac 


5220 


ttatatctat 


at crcrat OriOri 


crcrcracrcrt acrcr 
y y y ay y L-ciy y 


crt crap pt cirici 


driCCriCCCt' C 

y NW Nw- L#« 


Orioct t pt crp 


5280 




acagt caeca 


cct cc age to 


cacacgagtc 


ccat cgt ggt 


ttaccaaaga 


5340 




tt tggaccag 


tt tggaaagt 


cacccgctga 


attgtgaggc 


t agatt aat a 


5400 


gagctgaaga 


gcaaatgtt c 


ccaacttgga 


gat act agt t 


ggt at t agt a 


tcagaggaac 


5460 


agggccat ag 


cacct ccatg 


ct att agatt 


ccggctggca 


tgtacttttc 


aagat gat tt 


5520 


gtaactaaca 


at ggcttatt 


gt get t gt ct 


taagt ctgtg 


tcctaatgta 


aat gtt cct t 


5580 


tggtttatat 


aacct t ct t g 


ecatt tgetc 


tt caggtgtt 


cttgcagaac 


ac tggct get 


56 40 


1 1 aat ct agt 


tt aactgt t g 


cttgattatt 


ct t agggat a 


agatctgaat 


aaactttttg 


5700 


tggctttggc 


a.gacttta.gc 


tt gggctt ag 


ct cccacatt 


agcttttgct 


gcct ttt ctg 


5760 


tgaagctatc 


aagatcctac 


tcaatgaeat 


tagctgggtg 


caggtgtacc 


aaatcctget 


58 20 


ctgtggaaca 


cattgt ctga 


tgataccgaa 


ggcaaacgtg 


aactcaaaga 


ggcacagagt 


5880 


taagaagaag 


tctgtgcaat 


tcagaggaaa 


agccaaagtg 


gccattagac 


acactttcca 


5940 


tgcagcattt 


gccagtaggt 


tt cat ataaa 


actacaaaat 


ggaataaacc 


actacaaatg 


6000 


ggaaaagcct 


gatactagaa 


tttaaatatt 


cacccaggct 


caaggggtgt 


ttcatggagt 


5050 
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aatatcactc 


tataaaagta 


gggcagccaa 


025CIP7SEQ 
ttattcacag 


LlSt.tXt 

acaaagcttt 


tttttttctg 


6120 


tgctgcagtg 


ctgtttttcg 


gctgatccag 


ggttacttat 


tgtgggtctg 


agagctgaat 


6180 


gatttctcct 


tgtgtcatgt 


tggtgaagga 


gatatggcca 


gggggagatg 


agcatgttca 


6240 


agaggaaacg 


ttgcattttg 


gtggcttggg 


agaaaggtag 


aacgatatca 


ggtccatagt 


6300 


gtcactaaga 


gatctgaagg 


atggttttac 


agaacagttg 


acttggctgg 


gtgcaggctt 


6360 


ggctgtaaat 


ggatggaagg 


atggacagat 


gggtggacag 


agatttctgt 


gcaggagatc 


6420 


atctcctgag 


ctcggtgctt 


gacagactgc 


agatccatcc 


cataaccttc 


tccagcatga 


6480 


gagcgcgggg 


agctttggta 


ctgttcagtc 


tgctgcttgt 


tgcttcctgg 


gtgcacagtg 


6540 


gtgattttct 


tactcacaca 


gggcaaaaac 


ctgagcagct 


tcaaagtgaa 


caggttgctc 


6600 


tcataggcca 


ttcagttgtc 


aagatgaggt 


ttttggtttc 


ttgttttgta 


aggtgggaag 


6660 


aagcactgaa 


ggatcagttg 


cgagggcagg 


ggtttagcac 


tgttcagaga 


agtcttattt 


6720 


taactcctct 


catgaacaaa 


aagagatgca 


ggtgcagatt 


ctggcaagca 


tgcagtgaag 


6780 


gagaaagccc 


tgaatttctg 


atatatgtgc 


aatgttgggc 


acctaacatt 


ccccgctgaa 


6840 


gcacagcagc 


tccagctcca 


tgcagtactc 


acagctggtg 


cagccctcgg 


ctccagggtc 


6900 


tgagcagtgc 


tgggactcac 


gaggttccat 


gtctttcaca 


ctgataatgg 


tccaatttct 


6960 


ggaatgggtg 


cccatccttg 


gaggtcccca 


aggccaggct 


ggctgcgtct 


ccgagcagcc 


7020 


cgatctggtg 


gtgagtagcc 


agcccatggc 


aggagttaga 


gcctgatggt 


ctttaaggtc 


7080 


ccttccaacc 


taagccatcc 


tacgattcta 


ggaatcatga 


cttqtqaqtq 


tgtattgcag 


7140 


aggcaatatt 


ttaaagttat 


aaatgttttc 


tccccttcct 


tgtttgtcaa 


agttatcttg 


7200 


atcgccttat 


caatgctttt 


ggagtctcca 


gtcatttttc 


ttacamcaaa 


aagaggagga 


7260 


agaatgaaga 


gaatcattta 


atttcttgat 


tgaatagtag 


gattcagaaa 


gctgtacgta 


7320 


atqccqtctc 


tttgtatcga 


qctqtaaqqt 


ttctcatcat 


ttatcaqcqt 


qgtacatatc 


7380 


agcacttttc 


catctgatgt 


ggaaaaaaaa 


atccttatca 


tctacagtct 


ctgtacctaa 


7440 


acatcgctca 


gactctttac 


caaaaaagct 


ataggtttta 


aaactacatc 


tgctgataat 


7500 


ttgccttgtt 


ttagctcttc 


ttccatatgc 


tgcgtttgtg 


agaggtgcgt 


ggatgggcct 


7560 


aaactctcag 


ctgctgagct 


tgatgggtgc 


ttaagaatga 


agcactcact 


gctgaaactg 


7620 


ttttcatttc 


acaggaatgt 


tttagtggca 


ttgtttttat 


aactacatat 


tcctcagata 


T o 


aatgaaatcc 


agaaataatt 


atgcaaactc 


actgcatccg 


ttgcacaggt 


ctttatctgc 


7740 


tagcaaagga 


aataatttgg 


ggatggcaaa 


aacattcctt 


cagacatcta 


tatttaaagg 


7800 


aatataatcc 


tggtacccac 


ccacttcatc 


cctcattatg 


ttcacactca 


gagatactca 


7860 


ttctcttgtt 


gttatcattt 


gatagcgttt 


tctttggttc 


tttgccacgc 


tctgggctat 


7920 


ggctgcacgc 


tctgcactga 


tcagcaagta 


gatgcgaggg 


aagcagcagt 


gagaggggct 


7980 



Page 33 



wo 2006/093847 



PCT/US2006/006752 









025CIP7SEQ 


List .tXt 






gccctcagct 


ggcacccagc 


cgctcagcct 


aggaggggac 


cttgcctttc 


caccagctga 


8040 


ggtgcagccc 


tacaagctta 


cacgtgctgc 


gagcaggtga 


gcaaagggag 


tcttcatggt 


8100 


gtgtttcttg 


ctgcccggaa 


gcaaaacttt 


actttcattc 


attccccttg 


aagaatgagg 


8150 


aatgtttgga 


aacggactgc 


ttt acgttca 


atttctctct 


tccctttaag 


gctcagccag 


8220 


gggccattgc 


tgaggacggc 


atcggggccc 


cctggaccaa 


atctgtggca 


cagatggttt 


8280 


cacttacatc 


agtggatgtg 


ggatctgcgc 


ctgtaatgtg 


tccttctgaa 


ggaaggaacg 


8340 


tgccttccaa 


gtgccagccc 


cacagccccc 


agcccctccc 


tgtgctgctc 


caattcatct 


8400 


cctcttcctc 


cttctccctt 


tgctgtttgt 


gctcgggtag 


aaatcatgaa 


gatttagaag 


8450 


agaaaacaaa 


ataactggag 


tggaaaccca 


ggtgatgcag 


ttcattcagc 


tgtcataggt 


8520 


ttgtcgttgc 


tataggtctg 


tatcagagat 


gctarcacca 


ctttgctgtc 


ggtgcttaac 


8b80 


tcgggtgaac 


tctccttcac 


tcgcatcatt 


tgcgggcctt 


atttacatcc 


ccagcatcca 


8640 


tcaccctctg 


ggaaaatggg 


cgcactggat 


ctctaatgga 


agactttccc 


tctttcagag 


8700 


cctgtgggat 


gtgcagtgac 


aagaaacgtg 


gaggggctga 


gcagcagcac 


tgcccccagg 


8760 


gagcaggagc 


ggatgccatc 


ggtggcagca 


tcccaaatga 


tgtcagcgga 


tgctgagcag 


8820 


gcagcggacg 


aacggacaga 


agcgatgcgt 


acaccttctg 


ttgacatggt 


atttggcagc 


8880 


gatttaacac 


tcgcttccta 


gtcctgctat 


tctccacagg 


ctgcattcaa 


atgaacgaag 


8940 


ggaagggagg 


caaaaagatg 


caaaatccga 


gacaagcagc 


agaaatattt 


cttcgctacg 


9000 


gaagcgtgcg 


caaacaacct 


tctccaacag 


caccagaaga 


gcacagcgta 


acctttttca 


9060 


agaccagaaa 


aggaaattca 


caaagcctct 


gtggatacca 


gcgcgttcag 


ctctcctgat 


9120 


agcagatttc 


ttgtcaggtt 


gcgaatgggg 


tatggtgcca 


ggaggtgcag 


ggaccatatg 


9180 


atcatataca 


gcacagcagt 


cattgtgcat 


gtattaatat 


atattgagta 


gcagtgttac 


9240 


tttgccaaag 


caatagttca 


gagatgagtc 


ctgctgcata 


cctctatctt 


aaaactaact 


9300 


tataaatagt 


aaaaccttct 


cagttcagcc 


acgtgctcct 


ctctgtcagc 


accaatggtg 


9360 


cttcgcctgc 


acccagctgc 


aaggaatcag 


cccgtgatct 


cattaacact 


cagctctgca 


9420 


ggataaatta 


gattgttcca 


ctctcttttg 


ttgttaatta 


cgacggaaca 


attgttcagt 


9480 


gctgatggtc 


ctaattgtca 


gctacagaaa 


acgtctccat 


gcagttcctt 


ctgcgccagc 


Q A n 

y o 41 u 


aaactgtcca 


ggctatagca 


ccgtgatgca 


tgctacctct 


cactccatcc 


ttcttctctt 


9600 


tcccaccagg 


gagagctgtg 


tgttttcact 


ctcagccact 


ctgaacaata 


ccaaactgct 


9660 


acgcactgcc 


tccctcggaa 


agagaatccc 


cttgttgctt 


ttttatttac 


aggatccttc 


9720 


ttaaaaagca 


gaccatcatt 


cactgcaaac 


ccagagcttc 


atgcctctcc 


ttccacaacc 


9780 


gaaaacagcc 


ggcttcattt 


gtctttttta 


aatgctgttt 


tccaggtgaa 


ttttggccag 


9840 
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cgtgttggct 
attgcctctt 
cccttttctt 
ggaggcacca 
cctgaagctt 
taaagcccag 
agcagcactt 
gatgtaaaca 
ttctgtcaat 
gctgagttaa 
gctgtaacct 
cataagctgt 
tcatctttgc 
ctgccagctc 
gcagcaccgc 
tccctgttgc 
ctcacccaga 
tccagccatt 
tcagcaaaag 
ttccttcttt 
aacaattcag 
gccaggttca 
gaggagattc 
ttcagataga 
atccccatac 
ctgtgcccta 
cttgcagtcc 
tcccctgcag 
tgctacctga 
gatacgtttt 
tttgggcagg 
tggctcaggt 



gagatccagg 
tctggggttt 
agggtggctg 
gtgggggtgt 
gcagtccaag 
ataggactca 
gtttgcagcc 
gcgaaggtta 
cttacaaaac 
tgagacttgg 
gtctccccat 
aggatgcagc 
cgctcagtag 
tgcttgtttg 
tctgggtgac 
tgctgctcca 
cacctgggtt 
acttgtagaa 
ctgtgggaaa 
aacaaaaatt 
gt Lgaaaggg 
aagatggctc 
acagccactc 
atgttatcct 
tctccccttc 
agcagcccgc 
agcctgggac 
ggtacaaagt 
atcaagctct 
aatggtatca 
aggtgaggga 
cagatagtgg 



agcacgtgtc 
ccaagagggg 
ctgggcagca 
gttttgtgca 
gctttgtctg 
gaaatgtagt 
agtcctcccc 
attacctcct 
agaaccgaga 
cagctcgctg 
cgccttcaca 
tgcccaggga 
cgtgcagcaa 
tcaataagca 
ctgagtggcc 
qggactCQtq 
gcaatggctg 
atgtgctgtg 
gctctgaggc 
atccttagga 
ctctctggag 
aggatgctgt 
tgcttcccgt 
ttagctcagc 
tctggggtga 
tggcctctgc 
atttaattca 
tctgcacggg 
atttaataag 
gtgattttta 
cgggtctgcc 
tcactctgtg 



025CIP7SEQ 
agctttctgc 

gggagacttt 

gagtggctct 

ggggggaagc 

tgtacccagt 

cattccagcc 

aaaatgcaca 

tgtcaaaaac 

ggcagcgagc 

tgcagagatg 

ccgcagtgct 

tcaagagact 

ttactcatcc 

tttcttcatt 

acctggaacc 

tqctqctgga 

cagcgtgctc 

gaagcccttt 

tgctttcttg 

gagagcactg 

gtttccagcc 

gtgccttcct 

gccactcatg 

cttccctata 

aggccgtgtc 

tggatgtgtg 

tcaccagggt 

gtcctttcgg 

ttcataagca 

tttgctttgt 

gttggctctg 

gccagaagaa 
Page 



List .tXt 
tctcattgct 

gcgcggggat 

gggtcactgt 

attcacagaa 

gaaatccttc 

cccctcttcc 

gacctcgccg 

actttgtggt 

actgaagagc 

atccctgtgc 

gtcctggaca 

tttcctaagg 

caactatact 

ttgcctctaa 

cgaggggcac 

tgggqqgaaq 

ttcttggtat 

gtatctcttt 

ggtcgtggag 

tgcaagcat t 

tgactactgc 

gattatctgt 

gagaggaata 

accccatgag 

ccccagcccc 

cctatatgtc 

aatgtggaac 

ttcaggaaaa 

catggatgtg 

tgcttacttc 

cagtgatttc 

ggacaaagat 
35 



cctgttctgc 
gagataatgc 
ggcaccaatg 
tggggctgat 
ctctgttaca 
tcagatctgg 
agtggaggga 
ccatagatgt 
gtgttcccat 
ttcatgggag 
cctcaccctc 
ctcttaggac 
gaatgggttt 
gtttctctca 
agccaccacc 
catqaaqttc 
gcagattgtt 
ctgtggccct 
gaattgtatg 
gtgcacat aa 
tcgaagcaag 
gccaccaatg 
ttcccttata 
ggagctgcag 
ccttcccacc 
aatgcctgtc 
tgtgtcatct 
ccttcactgg 
ttttcctaga 
aaacagtgcc 
tccaggcgtg 
ggaaattgca 



9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
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gattgagtca 
cgaccactta 
aagcccagga 
ctaccagcta 
aatattagtt 
agaaattcct 
cagcctgctg 
tgcctttgct 
cctgtagcca 
aattaagtgg 
ggtagctaag 
tgtgctgttc 
ccctgtgctg 
caagggagat 
tgtgcacctc 
agcaggagct 
attcataagg 
tcccttaatt 
aaacccatat 
gaacgtgcca 
ctgtgaaagg 
ccaaaacagc 
gaaggctgga 
tttgcaggag 
ttccccgaac 
tcgcttaact 
tttttagtgg 
acttaagcag 
taacttcata 
ctggtgacaa 
aactgcatgg 



cgttaagcag 
ttgttgtttt 
atgtttggga 
agaaaaaagt 
tggaaaaaaa 
tgggtaagag 
accctctgca 
tgcccacaga 
aacagcgatt 
attgcagcag 
ggctgagcaa 
acagaggcag 
agaatggagt 
cggtgctcct 
cctcctgcta 
gtagtgcact 
gcaatatatc 
tatagaaaca 
ttccatgtag 
gtgctcatct 
aaccatggta 
agagctccag 
ggtgagagac 
gaaaggacaa 
ttctgtgtca 
gtgactaggt 
tatttgacag 
tgaaaccgcc 
ggttagatac 
agcctccctg 
aaagcagagc 



gcatcttgga 
ccccttttac 
aaagtgtagt 
cctacctttg 
caccaccacc 
agaaaggatg 
gttcaggacc 
tctctgccct 
agccataagc 
gagatcagtg 
cctgtcctac 
ggccagctgg 
gatttctggg 
catgcagtgc 
atgcagccgt 
cctgtaggtg 
ttgaggctgc 
aacacagcag 
aaatgtaacc 
cccatggcaa 
agaaacttcg 
acctcctctt 
agagcctgtc 
ctgtactttc 
tttgttctta 
ctgtaaataa 
catttacctt 
aagaaaccct 
ggtgctgcca 
gcctgtgctt 
tgcaggcacg 



025CIP7SEQ 
gtgatttgag 
aacagaagtt 
taaatgtttt 
gtatggtagt 
accagaaact 
tcgtatactg 
atgaaacgtg 
tgtgctgact 
tgcacctgac 
gcagggttgc 
agcacaagcc 
agctggagga 
tgctgttcct 
caaaactcgt 
gagcttatga 
ctagggaaaa 
gccaaatctt 
aataattatt 
tatatacaca 
aatactacag 
gttaaaggta 
aggaaagagc 
ccagttttcc 
aggcatagct 
agatgctttg 
gtgctccctg 
gctttcaagt 
tccttttatc 
gcttcacctg 
ttacctagag 
atggtgctga 

Page 



List .tXt 

gcagtttcat 

ttcatcaaaa 

gtaattcatt 

cctgcagaga 

gtaatggaaa 

gccaagtcct 

gcactgtaag 

cctgcacaca 

tttgaggatt 

agatgaaatc 

aaaccagcca 

ggttgtgctg 

gtggcttgca 

gtttgatgca 

aggcaatgag 

tctctggttc 

tctgaaatat 

ccaatgcctc 

cagccatgct 

gtattctcac 

tggctgcaaa 

cacttggaga 

tgtctctatt 

ggtgccctca 

gcagaacact 

ctgataaggt 

cttctaccaa 

aagctagtgc 

gcagtggttg 

gtgaatatcc 

gccttagctg 

36 



gaaagagcta 
taacgtggca 
tgtcggagtg 
atacaacatc 
atgtaaacca 
gcccagctgt 
acgtgtcccc 
agagcatttc 
aagagtttgc 
cttttctagg 
agggttttcc 
ggacccttct 
ctgagcagct 
gaaagatgga 
ccctcagtgc 
ccagggatgc 
tcatgcgtgt 
ccctcgaagg 
gcatccttca 
tatgttggac 
actactcata 
gggatggtgt 
ttctgaaacg 
cgtaaataag 
ttgagtcaat 
tcaagtgaca 
gctcttctat 
taaataccat 
gtcagttctg 
aagaatgcag 
cttcctgctg 



11820 
11880 
11940 
12000 
12050 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13520 
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ggagatgtgg 
ctatttaggg 
tttgtttggt 
ccagccaaaa 
aatattacca 
aacagttcgg 
agcagagcac 
gccttactgg 
ctgcctcaga 
agaatcagcc 
tttggcaacc 
atctttaacc 
aagttttaca 
gtgggcgtga 
tttcagagga 
gtgagggctg 
tctaaggagt 
tacattgatg 
gaaatttgtg 
aacaacaatt 
agcaagtaaa 
cactaacata 
tccccagtgc 
cgtgctagcc 
tcattggtta 
atcataatat 
tattgactag 
agttccgcgt 
gcccattgac 
gacgtcaatg 
atatgccaag 
cccagtacat 



atgcagagac 
ttctaccaga 
tttgaccact 
tcaattgcct 
gcctcattgt 
acgccacgtt 
cggcagccgc 
tggctctcct 
cccacagcct 
tgtttagctg 
agtttcagaa 
tgtttagcac 
ccgagctgta 
ccgagacccc 
tcaccctgta 
agatcatgag 
aaaaagtcta 
agtttggaca 
atgctattgc 
gcattcattt 
acctctacaa 
cgctctccat 
aagtgcaggt 
ctcgagcagg 
tatagcataa 
gtacatttat 
ttattaatag 
tacataactt 
gtcaataatg 
ggtggagtat 
tccgccccct 
gaccttacgg 



gaatgaagga 
gtccttaaga 
gagagcatgt 
gggagacgca 
gccgctgaca 
tgtatatatt 
ctgcagagcc 
ggtgctgagc 
gggcagcagg 
cctgaaggat 
ggctgagacc 
caaggatagc 
ccagcagctg 
tctgatgaag 
cctgaaggag 
gagctttagc 
gagtcggggc 
aaccacaact 
tttatttgta 
tatgtttcag 
atgtggtaaa 
caaaacaaaa 
gccagaacat 
atctatacat 
atcaatattg 
attggctcat 
taatcaatta 
acggtaaatg 
acgtatgttc 
ttacggtaaa 
attgacgtca 
gactttccta 



025CIP7SEQ 
cctgtccctt 

ggtttttttt 

gacacttgtc 

gaccattacc 

gattcagctg 

tgcaggcagc 

gggcagtacc 

tgcaagagca 

aggaccctga 

aggcacgatt 

atccctgtgc 

agcgctgctt 

aacgatctgg 

gaggatagca 

aagaagtaca 

ctgagcacca 

ggccggccgc 

agaatgcagt 

accattataa 

gttcaggggg 

atcgataccg 

cgaaacaaaa 

ttctctatcg 

tgaatcaata 

gctattggcc 

gtccaatatg 

cggggtcatt 

gcccgcctgg 

ccatagtaac 

ctgcccactt 

atgacggtaa 

cttggcagta 

Page 



List .tXt 
actcccctca 

ttttttggtc 

tcaagctatt 

tggaggtcag 

gctgctccgt 

ctcgggggga 

tcaccatggc 

gctgctctgt 

tgctgctggc 

ttggctttcc 

tgcacgagat 

gggatgagac 

aggcttgcgt 

tcctggctgt 

gcccctgcgc 

acctgcaaga 

ttcgagcaga 

gaaaaaaatg 

gctgcaataa 

agg Lg Lggga 

tcgacctcga 

caaactagca 

ataggtaccg 

ttggcaatta 

attgcatacg 

accgccatgt 

agttcatagc 

ctgaccgccc 

gccaataggg 

ggcagtacat 

atggcccgcc 

catctacgta 
37 



gcattctgtg 
caaaagtctg 
aaccaagtgt 
gacctcaata 
gttccagtcc 
ccatctcagg 
tttgaccttt 
gggctgcgat 
tcagatgagg 
tcaagaggag 
gatccagcag 
cctgctggat 
gatccagggc 
gaggaagtac 
ttgggaagtc 
gagcttgagg 
catgataaga 
ctttatttgt 
acaagttaac 
ggttttttaa 
ctagagcggc 
aaataggctg 
agctcttacg 
gccatattag 
ttgtatctat 
tgacattgat 
ccatatatgg 
aacgaccccc 
actttccatt 
caagtgtatc 
tggcattatg 
ttagtcatcg 



13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
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ctattaccat 
cacggggatt 
atcaacggga 
ggcgtgtacg 
ggagacgcca 
cctcgaagct 
ggtcggccgc 
aggagacgac 
gtcccccggg 
accgtcgacc 
cgcgtcgggc 
tggaccacgc 
gccgagttga 
caccggccca 
ggcaagggtc 
gtgcccgcct 
ttcaccgtca 
aagcccggtg 
ccccatggct 
ggcccaccga 
tgatgagttt 
ttgtgatgct 
caattgcatt 
gtaaaacctc 
acgatgtagg 
ccgggcgcgt 
tagttgatcc 
gcggtggtca 
agcgtcagcg 
ccgatgccct 
at cgtcgccg 



ggtgatgcgg 
tccaagtctc 
ctttccaaaa 
gtgggaggtc 
tccacgctgt 
cgactctagg 
cacgaccggt 
cttccatgac 
ccgtacgcac 
cggaccgcca 
tcgacatcgg 
cggagagcgt 
gcggttcccg 
aggagcccgc 
tgggcagcgc 
tcctggagac 
ccgccgacgt 
cctgacgccc 
ccgaccgaag 
ctctagagtc 
ggacaaacca 
attgctttat 
cattttatgt 
tacaaatgtg 
tcacggtctc 
actccacctc 
cggcgaacgc 
cggtgagcac 
ggttctcgac 
tgagagcctt 
cacttatgac 



ttttggcagt 
caccccattg 
tgtcgtaaca 
tatataagca 
tttgacctcc 
ggctcgagat 
gccgccacca 
cgagtacaag 
cctcgccgcc 
catcgagcgg 
caaggtgtgg 
cgaagcgggg 
gctggccgcg 
gtggttcctg 
cgtcgtgctc 
ctccgcgccc 
cgaggtgccc 
gccccacgac 
ccgacccggg 
ggggcggccg 
caactagaat 
ttgtaaccat 
ttcaggttca 
gtaaaatcga 
gaagccgcgg 
acccatctgg 
gcggcgcacc 
gggacgtgcg 
ggtcacggcg 
caacccagtc 
tgtcttcttt 



025CIP7SEQ 
acatcaatgg 
acgtcaatgg 
actccgcccc 
gagctcgttt 
atagaagaca 
ctgcgatcta 
tcccctgacc 
cccacggtgc 
gcgttcgccg 
gtcaccgagc 
gtcgcggacg 
gcggtgttcg 
cagcaacaga 
gccaccgtcg 
cccggagtgg 
cgcaacctcc 
gaaggaccgc 
ccgcagcgcc 
cggccccgcc 
gccgcttcga 
gcagtgaaaa 
tataagctgc 
gggggaggtg 
taaggatcaa 
tgcgggtgcc 
tccatcatga 
gggaagccct 
acggcgtcgg 
ggcatgtcga 
agctccttcc 
atcatgcaac 

Page 



List . tXt 
gcgtggatag 
gagtttgttt 
attgacgcaa 
agtgaaccgt 
ccgggaccga 
agtaagcttg 
cacgcccctg 
gcctcgccac 
actaccccgc 
tgcaagaact 
acggcgccgc 
ccgagatcgg 
tggaaggcct 
gcgtctcgcc 
aggcggccga 
ccttctacga 
gcacctggtg 
cgaccgaaag 
gaccccgcac 
gcagacatga 
aaatgcttta 
aataaacaag 
tgggaggttt 
ttcggcttca 
agggcgtgcc 
tgaacgggtc 
cgccctcgaa 
cgggtgcgga 
cagccgaatt 
ggtgggcgcg 
tcgtaggaca 

38 



cggtttgact 
tggcaccaaa 
atgggcggta 
cagatcgcct 
tccagcctcc 
catgcctgca 
acccctcaca 
ccgcgacgac 
cacgcgccac 
cttcctcacg 
ggtggcggtc 
cccgcgcatg 
cctggcgccg 
cgaccaccag 
gcgcgccggg 
gcggctcggc 
catgacccgc 
gagcgcacga 
ccgcccccga 
taagatacat 
tttgtgaaat 
ttaacaacaa 
tttaaagcaa 
ggtaccgtcg 
cttgggctcc 
gaggtggcgg 
accgctgggc 
tacgcggggc 
gatccgtcga 
gggcatgact 
ggtgccggca 



15600 
15660 
15720 
15780 
15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
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025CIP7SEQ List.tXt 
gc 17402 

<210> 9 

<211> 5172 

<212> DNA 

<213> Artifi cial Sequence 

<220> 

<223> Plasmid pRSV-Int 

<400> 9 



ctgcattaat 


gaatcggcca 


acgcgcgggg 


agaggcggtt 


tgcgtattgg 


gcgctcttcc 


60 


gcttcctcgc 


tcactgactc 


gctgcgctcg 


gtcgttcggc 


tgcggcgagc 


ggtatcagct 


120 


cactcaaagg 


cggtaatacg 


gttatccaca 


gaatcagggg 


ataacgcagg 


aaagaacatg 


180 


tgagcaaaag 


gccagcaaaa 


ggccaggaac 


cgtaaaaagg 


ccgcgttgct 


ggcgtttttc 


240 


cataggctcc 


gcccccctga 


cgagcatcac 


aaaaatcgac 


gctcaagtca 


gaggtggcga 


300 


aacccgacag 


gactataaag 


ataccaggcg 


tttccccctg 


gaagctccct 


cgtgcgctct 


360 


cctgttccga 


ccctgccgct 


taccggatac 


ctgtccgcct 


ttctcccttc 


gggaagcgtg 


420 


gcgctttctc 


aatgctcacg 


ctgtaggtat 


ctcagttcgg 


tgtaggtcgt 


tcgctccaag 


480 


ctgggctgtg 


tgcacgaacc 


ccccgttcag 


cccgaccgct 


gcgccttatc 


cggtaactat 


540 


cgtcttgagt 


ccaacccggt 


aagacacgac 


ttatcgccac 


tggcagcagc 


cactggtaac 


500 


aggattagca 


gagcgaggta 


tgtaggcggt 


gctacagagt 


tcttgaagtg 


gtggcctaac 


660 


tacggctaca 


ctagaaggac 


agtatttggt 


atctgcgctc 


tgctgaagcc 


agttaccttc 


720 


ggaaaaagsg 


t tggtagctc 


ttgatccggc 


aa a caa acca 


ccgct ggt ag 


cggtggtttt 


780 


tttgtttgca 


agcagcagat 


tacgcgcaga 


aaddddyyat 


ctcaayddya 


tcctttgatc 


840 


ttttctacgg 


ggtctgacgc 


tcagtggaac 


gaaaactcac 


gttaagggat 


tttggtcatg 


900 


agattatcaa 


aaaggatctt 


cacctagatc 


cttttaaatt 


aaaaatgaag 


ttttaaatca 


960 


atctaaagta 


tatatgagta 


aacttggtct 


gacagttacc 


aatgcttaat 


cagtgaggca 


1020 


cctatctcag 


cgatctgtct 


atttcgttca 


tccatagttg 


cctgactccc 


cgtcgtgtag 


1080 


ataactacga 


tacgggaggg 


cttaccatct 


ggccccagtg 


ctgcaatgat 


accgcgagac 


1140 


ccacgctcac 


cggctccaga 


tttatcagca 


ataaaccagc 


cagccggaag 


ggccgagcgc 


1200 


agaagtggtc 


ctgcaacttt 


atccgcctcc 


atccagtcta 


ttaattgttg 


ccgggaagct 


1260 


agagtaagta 


gttcgccagt 


taatagtttg 


cgcaacgttg 


ttgccattgc 


tacaggcatc 


1320 


gtggtgtcac 


gctcgtcgtt 


tggtatggct 


tcattcagct 


ccggttccca 


acgatcaagg 


1380 


cgagttacat 


gatcccccat 


gttgtgcaaa 


aaagcggtta 


gctccttcgg 


tcctccgatc 


1440 


gttgtcagaa 


gtaagttggc 


cgcagtgtta 


tcactcatgg 


ttatggcagc 


actgcataat 


1500 


tctcttactg 


tcatgccatc 


cgtaagatgc 


ttttctgtga 


ctggtgagta 


ctcaaccaag 


1560 
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025CIP7SEQ 


List . tXt 








aatagtgtat 


gcggcgaccg 


agttgctct t 


gc ccggcgt c 


aatacgggat 


1620 


Scitscccrccfc 


cacatagcag 


aactttaaaa 


gtgctcatc a 


tt ggaaaacg 


ttcttcgggg 


1680 


cgaaaactct 


caaggatctt 


accgctgttg 


agat cc agt t 


eg at gt aac c 


cactcgtgca 


1740 


cccaactgat 


cttcagcatc 


ttttactttc 


accagc gt 1 1 


ct gggt gage 


aaaaacagga 


1800 




ccgcaaaaaa 


gggaat aagg 


gcgaca egg a 


aatgttgaat 


actcatactc 


186 0 


ttcctttttc 


aatattattg 


aagcatttat 


cagggttatt 


gtctcatgag 


cggatacata 


1920 




tttagaaaaa 


'taaacaaa'ta 


ggggtt CCgC 


gcacatttcc 


ccgaaaagtg 


1980 


ccacctgacg 


tcgacggatc 


gggagatctc 


ccgatcccct 


atggtcgact 


ctcagtacaa 


2040 


tctgctctga 


tgccgcatag 


ttaagccagt 


atctgctccc 


tgcttgtgtg 


ttggaggtcg 


2100 


ct gagt agt g 


cgcgagcaaa 


atttaagcta 


caacaaggca 


aggcttgacc 


gacaattgca 


2160 


tgaagaatct 


gcttagggtt 


aggcgttttg 


cgctgcttcg 


cgatgtacgg 


gccagatata 


2220 


cgcgtgctag 


gggt ct agga 


tcgattctag 


gaattctcta 


gccgcggtct 


agggatcccg 


2280 


gcgcgtatgg 


tgcactctca 


gtacaatctg 


ctctgatgcc 


gcatagttaa 


gccagtatct 


2340 


gctccctgct 


tgtgtgttgg 


aggtcgctga 


gtagtgcgcg 


agcaaaattt 


aagctacaac 


2400 


aaggcaaggc 


ttgaccgaca 


attgcatgaa 


gaatctgctt 


agggttaggc 


gttttgcgct 


2460 


gctt cgcgat 


gtacgggcca 


gatatacgcg 


tatctgaggg 


ga ct agggt g 


tgtttaggcg 


2520 


aaaagcgggg 


cttcggttgt 


acgcggttag 


gagtcccctc 


aggatatagt 


agtttcgctt 


2580 


ttgcataggg 


agggggaaat 


gtagtcttat 


gcaatacact 


tgtagtcttg 


caacatggta 


2640 


acgatgagtt 


agcaacatgc 


cttacaagga 


gagaaaaagc 


accgtgcatg 


ccgattggtg 


2700 


gaagtaaggt 


ggtacgatcg 


tgccttatta 


ggaaggcaac 


agacaggtct 


gacatggatt 


2760 


ggacgaacca 


ctgaattccg 


cattgcagag 


ataattgtat 


ttaagtgcct 


agctcgatac 


2820 


aataaacgcc 


atttgaccat 


tcaccacatt 


ggtgtgcacc 


tccaagcttg 


catgcctgca 


2880 


ggt a ccggt c 


cggaattccc 


gggtcgacga 


gctcactagt 


cgt agggt eg 


ccgacatgac 


2940 


acaaggggtt 


crt era c c ocrcrcr 


tggacacgta 


cgcgggtgct 


tacgaccgtc 


agtcgcgcga 


3000 


gcgcgagaat 


tcgagcgcag 


caagcccagc 


gacacagcgt 


agcgccaacg 


aagacaaggc 


3060 


ggccgacctt 


cagcgcgaag 


tcgagcgcga 


cgggggccgg 


ttcaggttcg 


tcgggcattt 


3120 


cagcgaagcg 


ccgggcacgt 


cggcgttcgg 


gacggcggag 


cgcccggagt 


tcgaacgcat 


318 0 


cctgaacgaa 


tgccgcgccg 


ggcggctcaa 


catgatcatt 


gtctatgacg 


tgtcgcgctt 


3240 


ctcgcgcctg 


aaggtcatgg 


acgcgattcc 


gattgtctcg 


gaattgctcg 


ccctgggcgt 


3300 


gacgattgtt 


tccactcagg 


aaggcgtctt 


ccggcaggga 


aacgtcatgg 


acctgattca 


3360 


cctgattatg 


cggctcgacg 


cgtcgcacaa 


agaatcttcg 


ctgaagtcgg 


cgaagattct 


3420 
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cgacacgaag 


aaccttcagc 


gcgaattggg 


025CIP7SEQ 
cgggtacgtc 


List .tXt 
qQCQcrqaagQ 


cgccttacgg 


3480 


cttcgagctt 


gtttcqgaga 


cqaaqgaqat 


cacgcgcaac 


qgccqaatgg 

3 3^ ^3 33 


tcaatgtcgt 


3540 


catcaacaag 


cttgcgcact 


cgaccactcc 


ccttaccgga 


cccttcgagt 


tcgagcccga 


3600 


cgtaatccgg 


tggtggtggc 


gtgagatcaa 


gacgcacaaa 


caccttccct 


tcaagccggg 

3 3 3 3 


3660 


cagtcaagcc 


gccattcacc 


cgggcagcat 


cacggggctt 


tgtaagcgca 


tggacgctga 


3720 


cgccgtgccg 


acccggggcg 


agacgattgg 


gaagaagacc 


gcttcaagcg 


cctgggaccc 


3780 


ggcaaccgtt 


atgcgaatcc 


ttcgggaccc 


gcgtattgcg 

3 3 — . — . ^ — ^ 


ggcttcgccg 

33 — — ^ — — ^ 


ctgaqgtqat 

3 3 3 J 


3840 


ctacaagaag 


aagccggacg 


gcacgccgac 


cacgaagatt 


gagggttacc 


gcattcagcg 


3900 


cgacccgatc 


acgctccggc 


cggtcqaqct 


tqattgcqga 


ccgatcatcg 


agcccgctga 


3960 


gtggtatgag 


cttcaggcgt 


ggttggacgg 


cagggggcgc 


ggcaaggggc 


tttcccgggg 


4020 


gcaagccatt 


ctgtccgcca 


tggacaagct 


gtactgcgag 


tgtggcgccg 


tcatgacttc 


4080 


gaagcgcggg 


gaagaatcga 


tcaaggactc 


ttaccgctgc 


cgtcgccgga 

3 3 3 3 


aggtggtcga 


4140 


cccgtccgca 


cctgggcagc 


acgaaggcac 


gtgcaacgtc 


agcatggcgg 


cactcgacaa 


4200 


gttcgttgcg 


gaacgcatct 


tcaacaagat 


caggcacgcc 


gaaggcgacg 


aagagacgtt 


4260 


gqcqcttctq 


tqqqaaqccq 


cccgacgctt 


cggcaagctc 


actgaggcqc 


ctqagaagag 


4320 


cggcgaacgg 


gcgaaccttg 


ttgcggagcg 


cgccgacgcc 


ctgaacgccc 


ttgaagagct 


4380 


gtacgaagac 


cgcgcgqcaq 


gcgcgtacga 


cggacccgtt 


ggcaggaagc 


acttccggaa 


4440 


gcaacaggca 


qcqctqacqc 


tccggcagca 


aqqqqcqqaa 


qaqcgqcttq 


ccgaacttga 


4500 


agccgccgaa 


gccccgaagc 


ttccccttga 


ccaatggttc 


cccgaagacg 


ccgacgctga 


4560 


cccgaccggc 


cctaagtcgt 


qqt qqaqqcq 


cgcy t. c:agt a 


gacgacaagc 


gcgtgttcgt 


4520 


cgggctcttc 


gtagacaaga 


tcgttgtcac 


gaagtcgact 


acgqgcaqgg 


qgcaggqaac 


4680 


gcccatcgag 


aagcgcgctt 


cgatcacgtg 


ggcgaagccg 


ccgaccgacg 


acgacgaaga 


4740 


cgacgcccag 


qacqqcacqq 


aagacgtagc 


gqcgtagcga 

3 3 3 _J 


gacacccgga 


tccctcgagg 


4800 


ggccctattc 


tatagtgtca 


cctaaatgct 


agagctcgct 


gatcagcctc 


gactgtgcct 


4850 


tctagttgcc 


agccatctgt 


tgtttgcccc 


tcccccgtgc 


cttccttgac 


cctggaaggt 


4920 


gccactccca 


ctgtcctttc 


ctaataaaat 


gaggaaattg 


catcgcattg 


tctgagtagg 


4980 


tgtcattcta 


ttctgggggg 


tggggtgggg 


caggacagca 


agggggagga 


ttgggaagac 


5040 


aatagcaggc 


atgctgggga 


tgcggtgggc 


tctatggctt 


ctgaggcgga 


aagaaccagg 


5100 


tgcccagtca 


tagccgaata 


gcctctccac 


ccaagcggcc 


ggagaacctg 


cgtgcaatcc 


5160 


actgggggcg 


eg 










5172 


<210> 10 
<211> 6233 
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025CIP7SEQ Li St.tXt 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> PI asm! d pCR-XL-TOPO-CMV-pur-attB 



<400> 10 



agcgcccaat 


acgcaaaccg 


cctctccccg 


cgcgttggcc 


gattcattaa 


tgcagctggc 


60 


acgacaggtt 


tcccgactgg 


aaagcgggca 


gtgagcgcaa 


cgcaattaat 


gtgagttagc 


120 


tcactcatta 


ggcaccccag 


gctttacact 


ttatgcttcc 


ggctcgtatg 


ttgtgtggaa 


180 


ttgtgagcgg 


ataacaattt 


cacacaggaa 


acagctatga 


ccatgattac 


gccaagctat 


240 


ttaggtgacg 


cgttagaata 


ctcaagctat 


gcatcaagct 


tggtaccgag 


ctcggatcca 


300 


ctagtaacgg 


ccgccagtgt 


gctggaattc 


gcccttggcc 


gcaataaaat 


atctttattt 


360 


tcattacatc 


tgtgtgttgg 


ttttttgtgt 


gaatcgatag 


tactaacata 


cgctctccat 


420 


caaaacaaaa 


cgaaacaaaa 


caaactagca 


aaataggctg 


tccccagtgc 


aagtgcaggt 


480 


gccagaacat 


ttctctatcg 


ataggtaccg 


agctcttacg 


cgtgctagcc 


ctcgagcagg 


540 


atctatacat 


tgaatcaata 


ttggcaatta 


gccatattag 


tcattggtta 


tatagcataa 


600 


atcaatattg 


gctattggcc 


attgcatacg 


ttgtatctat 


atcataatat 


gtacatttat 


660 


attggctcat 


gtccaatatg 


accgccatgt 


tgacattgat 


tattgactag 


ttattaatag 


720 


taatcaatta 


cggggtcatt 


agttcatagc 


ccatatatgg 


agttccgcgt 


tacataactt 


780 


acggtaaatg 


gcccgcctgg 


ctgaccgccc 


aacgaccccc 


gcccattgac 


gtcaataatg 


840 


acgtatgttc 


ccatagtaac 


gccaataggg 


actttccatt 


gacgtcaatg 


ggtggagtat 


900 


ttacggtaaa 


ctgcccactt 


ggcagtacat 


caagtgtatc 


atatgccaag 


tccgccccct 


960 


attgacgtca 


atgacggtaa 


atggcccgcc 


tggcattatg 


cccagtacat 


gaccttacgg 


1020 


gactttccta 


cttggcagta 


catctacgta 


ttagtcatcg 


ctattaccat 


ggtgatgcgg 


1080 


ttttggcagt 


acatcaatgg 


gcgtggatag 


cggtttgact 


cacggggatt 


tccaagtctc 


1140 




d d ^ ^ 




L^v^ ^ ^ d^ d dd 


Pti~ pi pi r^rmrr^ 

d ^ d d^^ <^ ^ d 


\ dd dd 


1200 


tgtcgtaaca 


actccgcccc 


attgacgcaa 


atgggcggta 


ggcgtgtacg 


gtgggaggtc 


1260 


tatataagca 


gagctcgttt 


agtgaaccgt 


cagatcgcct 


ggagacgcca 


tccacgctgt 


1320 


tttgacctcc 


atagaagaca 


ccgggaccga 


tccagcctcc 


cctcgaagct 


cgactctagg 


1380 


ggctcgagat 


ctgcgatcta 


agtaagcttg 


catgcctgca 


ggtcggccgc 


cacgaccggt 


1440 


gccgccacca 


tcccctgacc 


cacgcccctg 


acccctcaca 


aggagacgac 


cttccatgac 


1500 


cgagtacaag 


cccacggtgc 


gcctcgccac 


ccgcgacgac 


gtcccccggg 


ccgtacgcac 


1560 


cctcgccgcc 


gcgttcgccg 


actaccccgc 


cacgcgccac 


accgtcgacc 


cggaccgcca 


1620 


catcgagcgg 


gtcaccgagc 


tgcaagaact 


cttcctcacg 


cgcgtcgggc 


tcgacatcgg 


1680 
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caaggtgtgg 


gtcgcggacg 


acggcgccgc 


025CIP7SEQ 
qqtqqcqqtc 


List.tXt 
tggaccacgc 


cggagagcgt 


1740 


cgaagcgggg 


gcggtgttcg 


ccgagatcgg 


cccgcgcatg 


gccgagttga 


gcggttcccg 


1800 


qctqqccqcq 


cagcaacaga 


tqqaaqqcct 


cctqqcqccq 


caccggccca 


aqqaqcccqc 

3 3 3 ^ ^ 3 ^ 


1860 


gtggttcctg 


gccaccgtcg 


gcgtctcgcc 


cgaccaccag 


ggcaagggtc 


tgggcagcgc 


1920 


cgtcgtgctc 


cccggagtgg 


aggcggccga 


qcqcqccqqq 


gtgcccgcct 


tcctggagac 


1980 


ctccgcgccc 


cgcaacctcc 


ccttctacga 


gcggctcggc 


ttcaccgtca 


ccgccgacgt 


2040 


cgaqqtqccc 


gaagqaccqc 


gcacctgqtq 


catgacccgc 


aagcccqqtq 

3 33 3 


cctgacgccc 


2100 


gccccacgac 


ccgcagcgcc 


cgaccgaaag 


qaqcqcacqa 


ccccatggct 


ccgaccgaag 


2160 


ccqacccqqq 


cggccccgcc 


gaccccgcac 


ccgcccccga 


ggcccaccga 


ctctagagtc 


2220 


ggggcggccg 


gccgcttcga 


gcagacatga 


taagatacat 


tgatgagttt 


ggacaaacca 


2280 


caactagaat 


gcagtgaaaa 


aaatgcttta 


tttgtgaaat 


ttgtgatgct 


attgctttat 


2340 


ttgtaaccat 


tataagctgc 


aataaacaag 


ttaacaacaa 


caattgcatt 


cattttatgt 


2400 


ttcaggttca 


gggggaggtg 


tgggaggttt 


tttaaagcaa 


gtaaaacctc 


tacaaatgtg 


2460 


gtaaaatcga 


taaggatcaa 


ttcggcttca 


ggtaccgtcg 


acgatgtagg 


tcacggtctc 


2520 


gaaqccgcgg 


tgcqqqtqcc 


agggcgtgcc 


cttgggctcc 


ccgggcgcgt 


actccacctc 


2580 


acccatctgg 


tccatcatga 


tgaacgggtc 


gaggtggcgg 


tagttgatcc 


cggcgaacgc 


2 6-10 


gcggcgcacc 


gggaagccct 


cgccctcgaa 


accgctgggc 


gcggtggtca 


cggtgagcac 


2700 


qQQacqtqcg 


acggcgtcgg 


cqqqtqcqqa 


tacgcggggc 


agcgtcagcg 


ggttctcgac 


2760 


qgtcacqqcg 


qqcatqtcqa 


cagccgaatt 


gatccgtcga 


ccgatgccct 


tgagagcctt 


2820 


caacccagtc 


agctccttcc 


ggtgggcgcg 


gggcatgact 


atcgtcgccg 


cacttatgac 


2880 


tgtcttcttt 


atcatgcaac 


tcgtaggaca 


ggtgccggca 


gcgctcttcc 


gcttcctcgc 


2940 


tcactgactc 


qctqcgctcg 


gtcgttcggc 


tgcggcgagc 

3 3 3 3 3 


ggtatcagct 


cactcaaagg 


3000 


cggtaatacg 


gttatccaca 


gaatcagggg 


ataacgcagg 


aaagaacatg 


aagggcgaat 


3050 


tctgcagata 


tccatcacac 


tggcggccgc 


tcgagcatgc 


atctagaggg 


cccaattcgc 


3120 


cctatagtga 


gtcgtattac 


aattcactgg 


ccgtcgtttt 


acaacgtcgt 


gactgggaaa 


3180 


accctggcgt 


tacccaactt 


aatcgccttg 


cagcacatcc 


ccctttcgcc 


agctqgcqta 


3240 


atagcgaaga 


ggcccgcacc 


gatcgccctt 


cccaacagtt 


gcgcagccta 


tacgtacggc 


3300 


agtttaaggt 


ttacacctat 


aaaagagaga 


gccgttatcg 


tctgtttgtg 


gatgtacaga 


3360 


gtgatattat 


tgacacgccg 


gggcgacgga 


tggtgatccc 


cctggccagt 


gcacgtctgc 


3420 


tgtcagataa 


agtctcccgt 


gaactttacc 


cggtggtgca 


tatcggggat 


gaaagctggc 


3480 


gcatgatgac 


caccgatatg 


gccagtgtgc 


cggtctccgt 


tatcggggaa 


gaagtggctg 


3540 


atctcagcca 


ccgcgaaaat 


gacatcaaaa 


acgccattaa 


cctgatgttc 


tggggaatat 


3600 
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025CIP7SEQ 


List . tXt 






aaatgtcagg 


catgagatta 


tcaaaaagga 


tcttcaccta 


gatccttttc 


acgtagaaag 


3660 


ccagtccgca 


gaaacggtgc 


tgaccccgga 


tgaatgtcag 


ct actgggct 


at ctggacaa 


3720 


gggaaaacgc 

-J -J -J -J 


aagcgcaaag 


aqaaaqcaqq 


tagcttgcag 


tgggcttaca 


tggcgatagc 


3780 


tagactgggc 


ggttttatgg 


acagcaagcg 


aaccggaatt 


gccagctggg 


gcgccctctg 


3840 


gtaaggttgg 


gaagccctgc 


aaagtaaact 


ggatggcttt 


ct cgccgcca 


aggat ctgat 


3900 


qqcgcagggq 


atcaagctct 


gatcaagaga 


caqqat qaqq 


atcgtttcgc 


atgattgaac 


3960 


aagatggatt 


gcacgcaggt 


tctccggccg 


cttqqqtqqa 


gaggctattc 


ggctatgact 


4020 


gggcacaaca 


gacaatcggc 


tgctctgatg 


ccgccgtgtt 


ccggctgtca 


qcqcaqqqqc 


4080 


gcccggttct 


ttttgtcaag 


accgacctgt 


ccggtgccct 


gaatgaactg 


ca aga cgagg 


4140 


cagcgcggct 


atcgt ggctg 


gccacgacgg 


gcgttccttg 


cgcagctgtg 


ctcgacgttg 


4200 


tcactgaagc 


qqqaaqqqac 


tggctgctat 


tgggcgaagt 


qccqqqqcaq 


gatctcctgt 


4260 


catctcacct 


tgctcctgcc 


gagaaagtat 


ccatcatggc 


tgatgcaatg 


cggcggctgc 


4320 


atacgcttga 


tccggctacc 


tgcccattcg 


accaccaagc 


gaaacatcgc 


at cgagcgag 


4380 


cacgtactcg 


gatggaagcc 


ggtcttgtcg 


atcaggatga 


tctggacgaa 


gagcatcagg 


4440 


qqctcgcqcc 


agccgaactg 


ttcgccaggc 


tcaaqqcqaq 


catgcccgac 


qqcqaqqatc 


4500 


tcgtcgtgac 


ccatggcgat 


gcctgcttgc 


cgaatatcat 


ggtggaaaat 


ggccgctttt 


4560 


ctggattcat 


cgactgtggc 


cggctgggtg 


tggcggaccg 


ct atcaggac 


atagcgttgg 


4620 


ctacccgtga 


tattgctgaa 


gagcttggcg 


gcgaatgggc 


tgaccgcttc 


ctcgtgcttt 


4680 


acggtatcgc 


cgctcccgat 


tcgcagcgca 


tcgccttcta 


tcgccttctt 


gacgagttct 


4740 


tctgaattat 


taacgcttac 


aatttcctga 


tgcggtattt 


tctccttacg 


catctgtgcg 


4800 


gtatttcaca 


ccgcatacag 


gtggcacttt 


tcggggaaat 


gtgcgcggaa 


cccctatttg 


4860 


tttatttttc 


taaatacatt 


caaatatgta 


tccgctcatg 


agacaataac 


cctgataaat 


4920 


gcttcaataa 


tagcacgtga 


ggagggccac 


catggccaag 


ttgaccagtg 


ccgttccggt 


4980 


gctcaccgcg 


cgcgacgtcg 


ccggagcggt 


cgagttctgg 


accgaccggc 


tcgggttctc 


5040 


ccgggacttc 


gtggaggacg 


acttcgccgg 


tgtggtccgg 


gacgacgtga 


ccctgttcat 


5100 


cagcgcggtc 


caggaccagg 


tggtgccgga 


caacaccctg 


gcctgggtgt 


qqqtqcqcqq 


5160 


cctggacgag 


ctgtacgccg 


agt ggt cgga 


ggt cgt gt cc 


acgaacttcc 


gggacgcctc 


c 0 0 


cgggccggcc 


atgaccgaga 


tcggcgagca 


gccgtggggg 


cgggagttcg 


ccctgcgcga 


5280 


cccggccggc 


aactgcgtgc 


acttcgtggc 


cgaggagcag 


gactgacacg 


tgctaaaact 


5340 


tcatttttaa 


tttaaaagga 


tctaggtgaa 


gatccttttt 


gataatctca 


tgaccaaaat 


5400 


cccttaacgt 


gagttttcgt 


tccactgagc 


gtcagacccc 


gt agaaaaga 


tcaaaggatc 


5460 
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ttcttgagat 


cctttttttc 


tgcgcgtaat 


025CIP7SEQ 
ctgctgcttg 


List.tXt 
caaacaaaaa 


aaccaccgct 


5520 


accagcggtg 


gtttgtttgc 


cggatcaaga 


gctaccaact 


ctttttccga 


aggtaactgg 


5580 


cttcagcaga 


gcgcagatac 


caaatactgt 


ccttctagtg 


tagccgtagt 


taggccacca 


5640 


cttcaagaac 


tctgtagcac 


cgcctacata 


cctcgctctg 


ctaatcctgt 


taccagtggc 


5700 


tgctgccagt 


ggcgataagt 


cgtgtcttac 


cgggttggac 


tcaagacgat 


agttaccgga 


5760 


taaggcgcag 


cggtcgggct 


gaacgggggg 


ttcgtgcaca 


cagcccagct 


tggagcgaac 


5820 


gacctacacc 


gaactgagat 


acctacagcg 


tgagctatga 


gaaagcgcca 


cgcttcccga 


5880 


agggagaaag 


gcggacaggt 


atccggtaag 


cggcagggtc 


ggaacaggag 


agcgcacgag 


5940 


ggagcttcca 


gggggaaacg 


cctggtatct 


ttatagtcct 


gtcgggtttc 


gccacctctg 


6000 


acttgagcgt 


cgatttttgt 


gatgctcgtc 


aggggggcgg 


agcctatgga 


aaaacgccag 


6060 


caacgcggcc 


tttttacggt 


tcctgggctt 


ttgctggcct 


tttgctcaca 


tgttctttcc 


6120 


tgcgttatcc 


cctgattctg 


tggataaccg 


tattaccgcc 


tttgagtgag 


ctgataccgc 


6180 


tcgccgcagc 


cgaacgaccg 


agcgcagcga 


gtcagtgagc 


gaggaagcgg 


aag 


6233 



<210> 11 

<211> 234 

< 2 1 2 > DNA 

<213> Artificial sequence 
<220> 

<223> attP containing polynucleotide 

<400> 11 



gactagtact 


gacggacaca 


ccgaagcccc 


ggcggcaacc 


ctcagcggat 


gccccggggc 


60 


ttcacgtttt 


cccaggtcag 


aagcggtttt 


cgggagtagt 


gccccaactg 


gggtaacctt 


120 


tgagttctct 


cagttggggg 


cgtagggtcg 


ccgacatgac 


acaaggggtt 


gtgaccgggg 


180 


tggacacgta 


cgcgggtgct 


tacgaccgtc 


agtcgcgcga 


gcgcgactag 


taca 


234 



<210> 12 

<211> 26 

< 2 1 2 > DNA 

<213> Artificial Sequence 

<220> 

<223> Primer attB-for 

<400> 12 

taccgtcgac gatgtaggtc acggtc 2 6 



<210> 13 

<211> 11 

<212> PRT 

<213> SV40 



<400> 13 
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025CIP7SEQ List.tXt 
cys Gly Gly Pro Lys Lys Lys Arg Lys Val Gly 
1 5 10 



<210> 14 

<211> 237 

<212> DNA 

<213> mouse 



<400> 14 
gacctggaat 


atcgcgagta 


aactgaaaat 


cacggaaaat 


gagaaataca 


cactttagga 


60 


cgtgaaatat 


ggcgaggaaa 


actgaaaaag 


gtggaaaatt 


tagaaatgtc 


cactgtagga 


120 


cgtggaatat 


ggcaagaaaa 


ctgaaaatca 


tggaaaatga 


gaaacatcca 


cttgacgact 


180 


tgaaaaatga 


cgaaatcact 


aaaaaacgtg 


aaaaatgaga 


aatgcacact 


gaaggac 


237 


<210> 15 
<211> 275 
<212> DNA 
<213> mouse 












<400> 15 
gagtgagtta 


cactgaaaaa 


cacatacgtt 


ggaaaccggc 


attgtagaac 


agtgtatatc 


60 


aatgagttac 


aatgagaaaa 


atggaaaatg 


ataaaaacca 


cagtgtagaa 


catattagat 


120 


gtgtgagtta 


cactgaaaaa 


cacattcctt 


ggaaacggga 


tttgtagaac 


tgtgtatatc 


180 


aatgagttac 


aatgagaaac 


atggaaaatg 


ataaaaacca 


cactgtagaa 


cattttagat 


240 


gagtgagtta 


cactgaaaaa 


cacatatgtt 


ggaaa 






275 


<210> 16 
<211> 9299 
<212> DNA 
<213> mouse 












< 4 0 0 > 16 
ggccgctctt 


ctcgttctgc 


cagcgggccc 


tcgtctctcc 


accccatccg 


tctgccggtg 


60 


gtgtgtggaa 


ggcaggggtg 


cggctctccg 


gcccgacgct 


gccccgcgcg 


cacttttctc 


120 


agtggttcgc 


gtggtccttg 


tggatgtgtg 


aggcgcccgg 


ttgtgccctc 


acgtgtttca 


180 


ctttggtcgt 


gtctcgcttg 


accatgttcc 


cagagtcggt 


ggatgtggcc 


ggtggcgttg 


240 


catacccttc 


ccgtctggtg 


tgtgcacgcg 


ctgtttcttg 


taagcgtcga 


ggtgctcctg 


300 


gagcgttcca 


ggtttgtctc 


ctaggtgcct 


gcttctgagc 


tggtggtggc 


gctccccatt 


360 


ccctggtgtg 


cctccggtgc 


tccgtctggc 


tgtgtgcctt 


cccgtttgtg 


tctgagaagc 


420 


ccgtgagagg 


ggggtcgagg 


agagaaggag 


gggcaagacc 


ccccttcttc 


gtcgggtgag 


480 


gcgcccaccc 


cgcgactagt 


acgcctgtgc 


gtagggctgg 


tgctgagcgg 


tcgcggctgg 


540 


ggttggaaag 


tttctcgaga 


gactcattgc 


tttcccgtgg 


ggagctttga 


gaggcctggc 


600 


tttcgggggg 


gaccggttgc 


agggtctccc 


ctgtccgcgg 
Page 


atgctcagaa 
46 


tgcccttgga 


660 
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025CIP7SEQ 


List . txt 






agagaacctt 


cctgttgccg 


cagacccccc 


cgcgcggt eg 


cccgcgtgtt 


ggtcttctgg 


720 


tttccctgtg 


tgctcgtcgc 


atgcatcctc 


tctcggtggc 


cggggct cgt 


cggggttttg 


780 


ggtccgtccc 


gccctcagtg 


agaaagtttc 


cttctctagc 


tatcttccgg 


aaagggtgcg 


840 


ggcttcttac 


ggtctcgagg 


ggtctctccc 


gaatggtccc 


ctggagggct 


cgccccctga 


900 


ccgcct cccg 


cgcgcgcagc 


gtttgctctc 


tcgtctaccg 


cggc ccgcgg 


cctccccgct 


960 


ccgagttcgg 


ggagggatca 


cqcqqqqcaq 


agcctgtctg 


tcgtcctgcc 


gttgctgcgg 


1020 


agcatqtqgc 


tcqqcttqtq 


tqqttqqt qq 


ctqqqqaqaq 


ggctccgtgc 


acacccccgc 


1080 


gtgcgcgtac 


tttcctcccc 


tcctgagggc 


cgccgtgcgg 


acqggqtgtq 

33 33 3 3 


ggtaggcgac 


1140 


ggtgggctcc 


cgggtcccca 


cccgtcttcc 


cgtgcctcac 


ccgtgccttc 


cgtcgcgtgc 


1200 


gtccctctcg 


ctcgcgtcca 


cgactttggc 


cgctcccgcg 


acggcggcct 


gcgccgcgcg 


1260 


tggtgcgtgc 


tgtgtgcttc 


tcgggctgtg 


tggttgtgtc 


gcctcgcccc 


ccccttcccg 


1320 


cggcagcgtt 


cccacggctg 


gcgaaatcgc 


gggagtcctc 


cttcccctcc 


tcggggtcga 


1380 


gagggtccgt 


gtctggcgtt 


gattgatctc 


gctctcgggg 


acgggaccgt 


tctgtgggag 


1440 


aacggctgtt 


ggccgcgtcc 


ggcgcgacgt 


cqqacqtqqq 

z) z) z) z) z) z) 


gacccactgc 


cqct cqqqqq 

— z) • ^ ZJ ZJ ZJ ZJ ZJ 


1500 


tcttcgtcgg 


taqqcatcqq 

33^ ^33 


tqtqtcqqca 

3 3 ^33 


tcggtctctc 


tctcgtgtcg 


gtgtcgcctc 


1560 


ctcgggctcc 


cggggggccq 


tcgtgtttcg 


ggtcggct eg 


gcgctgcagg 


tqtggtggqa 


1620 


ct get c aggg 


gagtggtgca 


gtgtgattcc 


cgccggtttt 


gcctcgcgtg 


ccctgaccgg 


1680 


tccgacgccc 


gagcggtctc 


tcggtccctt 


gtgaggaccc 


ccttccggga 


ggggcccgtt 


1740 


tcggccgccc 


ttgccgtcgt 


cgccggccct 


cgttctgctg 


tgtcgttccc 


ccctccccgc 


1800 


tcgccgcagc 


cggtcttttt 


tcctctctcc 


ccccctctcc 


tctgactgac 


ccgt ggccgt 


1860 


qctgtcggac 


cccccgcatg 


qqqqcqqccq 


qqcacqtacq 

3 3 3 3 


cqtccqqqcq 

3 333 3 


qtcaccqqqq 


1920 


tcttqggqgg 


qqqccqaqqq 


gtaagaaagt 


cggctcggcg 


qqcqqqaqqa 


gctgtggttt 


1980 


qqagqqcqtc 


ccggccccgc 


qqccqtqqcq 

_i'3 3 33 3 


qtqtcttqcq 

3 3 3 3 


cqqtcttqqa 

33 33 


qaqqqctqcq 


2040 


tcrcQacrcrQcra 

3 3 3 333 


aaaggttgcc 




aaaQQQaaacr 


acTQctacrcaQ 


tggtcattgt 


2100 


cccgacggtg 

3 3 3 3 


tggtggtctg 


ttggccgagg 

3 3 3 3 3 


tgcgtctggg 


gggctcgtcc 

3 3 3 3^ 


ggccctgtcg 

3 3 ****** ^ *^ ^ 


2160 


tccgtcggga 


aggcgcgtgt 


tggggcctgc 


cggagtgccg 


aggtgggtac 


cctggcggtg 


2220 


ggattaaccc 


cgcgcgcgtg 


tcccggtgtg 


gcggtggggg 


ctccggtcga 


tgtctacctc 


2280 


cctctccccg 


aggtctcagg 


ccttctccgc 


gcgggctctc 


ggccctcccc 


tcgttcctcc 


2340 


ctctcgcggg 


gttcaagtcg 


ctcgtcgacc 


tcccctcctc 


cgtccttcca 


tctctcgcgc 


2400 


aatggcgccg 


cccgagttca 


cggtgggttc 


gtcctccgcc 


tccgcttctc 


gccgggggct 


2460 


ggccgctgtc 


cggtctctcc 


tgcccgaccc 


ccgttggcgt 


ggtcttctct 


cgccggcttc 


2520 
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gcggactcct 


ggcttcgccc 


ggagggtcag 


025CIP7SEQ 
ggggcttccc 


List .txt 
ggttccccga 


cgttgcgcct 


2580 


cgctgctgtg 


tgcttggggg 


gggcccgctg 


cggcctccgc 


ccgcccgtga 


gcccctgccg 


2640 


cacccgccgg 


tgtgcggttt 


cgcgccgcgg 


tcagttgggc 


cctggcgttg 


tgtcgcgtcg 


2700 


ggagcgtgtc 


cgcctcgcgg 


cggctagacg 


cgggtgtcgc 


cgggctccga 


cgggtggcct 


2760 


atccagggct 


cgcccccgcc 


gacccccgcc 


tgcccgtccc 


ggtggtggtc 


gttggtgtgg 


2820 


ggagtgaatg 


gtgctaccgg 


tcattccctc 


ccgcgtggtt 


tgactgtctc 


gccggtgtcg 


2880 


cgcttctctt 


tccgccaacc 


cccacgccaa 


cccaccaccc 


tgctctcccg 


gcccggtgcg 


2940 


gtcgacgttc 


cggctctccc 


gatgccgagg 


ggttcgggat 


ttgtgccggg 


gacggagggg 


3000 


agagcgggta 


agagaggtgt 


cggagagctg 


tcccggggcg 


acgctcgggt 


tggctttgcc 


3060 


gcgtgcgtgt 


gctcgcggac 


gggttttgtc 


ggaccccgac 


ggggtcggtc 


cggccgcatg 


3120 


cactctcccg 


ttccgcgcga 


gcgcccgccc 


ggctcacccc 


cggtttgtcc 


tcccgcgagg 


3180 


ctctccgccg 


ccgccgcctc 


ctcctcctct 


ctcgcgctct 


ctgtcccgcc 


tggtcctgtc 


3240 


ccacccccga 


cgctccgctc 


gcgcttcctt 


acctggttga 


tcctgccagg 


tagcatatgc 


3300 


ttgtctcaaa 


gattaagcca 


tgcatgtcta 


agtacgcacg 


gccggtacag 


tgaaactgcg 


3360 


aatggctcat 


taaatcagtt 


atggttcctt 


tggtcgctcg 


ctcctctcct 


acttggataa 


3420 


ctgtggtaat 


tctagagcta 


atacatgccg 


acgggcgctg 


accccccttc 


ccgggggggg 


3480 


atgcgtgcat 


ttatcagatc 


aaaaccaacc 


cggtgagctc 


cctcccggct 


ccggccgggg 


3540 


y l-cyyy cgcc; 


yy cgyc L Lyy 


Ly dc Lc Lay d 


Lddc;ct:cggg 


ccgdLc;y(jti(; 


gt;cuuci;y Ly 


3600 


gcggcgacga 


cccattcgaa 


cgtctgccct 


atcaactttc 


gatggtagtc 


gccgtgccta 


3560 


ccatggtgac 


cacgggtgac 


ggggaatcag 


ggttcgattc 


cggagaggga 


gcctgagaaa 


3720 


cggctaccac 


atccaaggaa 


ggcagcaggc 


gcgcaaatta 


cccactcccg 


acccggggag 


3780 


gtagtgacga 


aaaataacaa 


tacaggactc 


tttcgaggcc 


ctgtaattgg 


aatgagtcca 


3840 


ctttaaatcc 


tttaacgagg 


atccattgga 


gggcaagtct 


ggtgccagca 


gccgcggtaa 


3900 


ttccagctcc 


aatagcgtat 


attaaagttg 


ctgcagttaa 


aaagctcgta 


gttggatctt 


3960 


gggagcgggc 


gggcggtccg 


ccgcgaggcg 


agtcaccgcc 


cgtccccgcc 


ccttgcctct 


4020 


cggcgccccc 


tcgatgctct 


tagctgagtg 


tcccgcgggg 


cccgaagcgt 


ttactttgaa 


4080 


aaaattagag 


tgttcaaagc 


aggcccgagc 


cgcctggat a 


ccgcagctag 


gaataatgga 




ataggaccgc 


ggttctattt 


tgttggtttt 


cggaactgag 


gccatgatta 


agagggacgg 


4200 


ccgggggcat 


tcgtattgcg 


ccgctagagg 


tgaaattctt 


ggaccggcgc 


aagacggacc 


4260 


agagcgaaag 


catttgccaa 


gaatgttttc 


attaatcaag 


aacgaaagtc 


ggaggttcga 


4320 


agacgatcag 


ataccgtcgt 


agttccgacc 


ataaacgatg 


ccgactggcg 


atgcggcggc 


4380 


gttattccca 


tgacccgccg 


ggcagcttcc 


gggaaaccaa 


agtctttggg 


ttccgggggg 


4440 
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0 2 5CIP'/SEQ 


List . tXt 






agtatggttg 


caaagctgaa 


acttaaagga 


attgacqqaa 


qggcaccacc 


aggagt gggc 


4500 


ctgcggctta 


atttgactca 


acacgggaaa 


cct cac ccgg 


cc cggacacg 


gacaggattg 


4560 


acagattgat 


agctctttct 


cgattccgtg 


qgt qgt gat g 


catggccgtt 


cttagttggt 


4620 


ggagcgattt 


gtctggttaa 


ttccgataac 


gaacgagact 


ctggcatgct 


aactagttac 


4680 


gcgacccccg 


acfcqqt cqqc 

J) J) J) J) J) 


gt cccccaac 


ttcttagagg 


qacaaqtqqc 

zy " ^ ZJ z> ZJ 


gttcagccac 


4740 


ccgagattga 


gcaataacag 


gtctgtgatg 


cccttagatg 


tccggggctg 


cacgcgcgct 


4800 


acactgactg 


qctcaqcqtq 


tgcctaccct 


qcqccqqcaq 

z} ZJ ^ ZJ ZJ 


gcgcgggtaa 


cccgttgaac 


4860 


cccattcgtg 


atggggatcg 


gggattgcaa 


ttattcccca 


tgaacgagga 


attcccagta 


4920 


agtgcgggt c 


ataagcttgc 


gttgattaag 


tccctgccct 


ttgtacacac 


cgcccgt cgc 


4980 


tactaccgat 


tggatggttt 


agtgaggccc 


tcggatcggc 


cccgccgggg 


tcggcccacg 


5040 


gccctggcgg 


agcgct gaga 


agacggtcga 


acttgactat 


ct agaggaag 


taaaagtcgt 


5100 


aacaaggttt 


ccgtaggtga 


acctgcggaa 


ggatcattaa 


acgggagact 


qt qqaqqaqc 


5160 


CTQccTcrccrtcrcr 


cccgctctcc 


ccgtcttgtg 


tgtgtcctcg 


ccqqqaqqcq 


cgtgcgtccc 


5220 


gggtcccgtc 


qcccqcqtqt 

J _J 3 3 


qqaqcqaqqt 


qtctqqaqtq 


aqqtqaqaqa 


aqqqqtqqqt 


5280 


gqggt cggt c 


tgggtccgtc 


tgggaccgcc 


tccgatttcc 


cctccccctc 


ccctctccct 


5340 


cgtccggctc 


tgacct cgcc 


accctaccgc 


CfCTCCrCT C CfCfC t 


gctcgcgggc 


gtcttgcctc 


5400 


tttcccgtcc 


ggctcttccg 


tgtctacgag 


QCTQCQcrtaccr 


tcgttacggg 


tttttgaccc 


5460 


gtcccqggqq 


cgttcggtcg 


tcqqqqcqcq 


cgctttgctc 


tcccggcacc 


catccccgcc 


5520 


gcggctctgg 


cttttctacg 


1 1 qqct qqqq 


cggttgtcgc 


qt qt qqqqqq 


atgtgagtgt 


5580 


cqcqtqtqqq 


ct cgcccgt c 


ccgatgccac 


gcttttctgg 


cctcgcgtgt 


cctccccgct 


5640 


cctgtcccgg 


gtacctagct 


gtcgcgttcc 


qqcqcqqaqq 


tttaaggacc 


ccqqqqqqqt 


5700 


cgccctgccg 


cccccagggt 


cqqqqqqcqq 

^^^^^^ ^ ^ ^ 


tqqqqcccqt 

Z? ZJ ZJ ZJ ZJ 


aqqqaaqtcq 


qtcqttcqqq 


5760 


cggctctccc 


tcagactcca 


tgaccctcct 


ccccccgctg 


ccgccgttcc 


cqaqqcqqcq 


5820 


gtcqtqtqqq 

3 l3 l5 ^333 


qqqqtqqatq 


tctggagccc 


cctcgggcgc 


cqtqqqqqcc 


cgacccgcgc 


5880 


cgccgqcttg 


cccgatttcc 


gcqqqt cqgt 

-w- ^^^^^^^ 


cctgtcqqtq 

^ ^^ ^ 


ccqqtcqtqq 

^^ ^ ^^ 


gttcccgtgt 


5940 


cgttcccgtg 


tttttccgct 


cccgaccctt 


tttttttcct 


cccccccaca 


cgtgtctcgt 


6000 


ttcgttcctg 


ctggccggcc 


tgaggctacc 


cctcgqtcca 


tctgttctcc 


tctctctccg 


b Ub U 


gggagaggag 


ggcggtggtc 


gttgggggac 


tgtgccgtcg 


tcagcacccg 


tgagttcgct 


6120 


cacacccgaa 


ataccgatac 


gactcttagc 


ggtggatcac 


tcggctcgtg 


cgtcgatgaa 


6180 


gaacgcagct 


agctgcgaga 


attaatgtga 


attgcaggac 


acattgatca 


tcgacacttc 


6240 


gaacgcactt 


gcggccccgg 


gttcctcccg 


gggctacgcc 


tgtctgagcg 


tcggttgacg 


6300 
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ggagtttgct 


cgcagggcca 


6360 


accccccaac 


ccgggtcggg 


ccctccgtct 


cccgaagttc 


agacgtgtgg 


gcggttgtcg 


6420 


qt qt qqcqcq 


cgcgcccgcg 


tcqcqqaqcc 


tggtctcccc 


cgcgcatccg 


cqctcqcqqc 


6480 


ttcttcccgc 


tccgccgttc 


ccgccctcgc 


ccgtgcaccc 


cggtcctggc 


ctcgcgtcgg 


6540 


cgcctcccgg 


accgctgcct 


caccagtctt 


tctcggtccc 


gtgccccgtg 


ggaacccacc 


6600 


gcgcccccgt 


ggcgcccggg 


ggtgggcgcg 


tccgcatctg 


ctctggtcga 


ggttggcggt 

3 3 3 3 3 3 


6660 


tgagggtgtg 


cgtgcgccga 


ggtggtggtc 


ggtcccctgc 


ggccgcgggg 


ttgtcggggt 


6720 


ggcggtcgac 


gaggqccgqt 


cggtcgcctg 


cggtggttgt 


ctgtgtgtgt 


ttgggtcttg 


6780 


cqctqqqqqa 




accgctcgcg 


qqqttqqcqc 


ggtcgcccgg 


cgccgcgcac 


6840 


cctccggctt 




aqaqcgaqgq 

3 3 3 333 


cgagaacgga 


gaqagqtgqt 

^ ^ ^ ^ 33 


atccccggtg 


6900 


gcgttgcgag 


qqaqqqtttq 

^^ ^^^ WWk^ 


gcgtcccgcg 


tccgtccgtc 


cctccctccc 


tcqqt qqgcq 

33 Z3 3 3 3 


6960 


ccttcgcgcc 


gcacgcggcc 


gctaqgqgcq 


gtcggggccc 


gtggcccccg 


tggctcttct 


7020 


tcgtctccgc 


ttctccttca 


cccgggcggt 


acccgctccg 


gcgccggccc 


gcgggacgcc 


7080 


gcggcgtccg 


tgcgccgatg 


cgagtcaccc 


ccgggt gttg 


cgagttcggg 


gagggagagg 


7140 


gcct cgctga 


cccgttgcgt 


cccggcttcc 


ctggggggga 

T7333z733 


cccggcgtct 


qtgggctqtq 


7200 


cgtcccgggg 


gttgcgtgtg 


agtaagatcc 


tccacccccg 


ccgccctccc 


ctcccgccgg 


7260 


cctctcgggg 


accccctgag 


acggttcgcc 


ggctcgtcct 


cccgtgccgc 


cgggtgccgt 


7320 


ctctttcccg 


cccgcct cct 


cgctctcttc 


ttcccgcggc 


tqqqcqcqtq 
^^^^^^ 


tccccccttt 


7380 


ctgaccgcga 


cctcagatca 


gacgtggcga 


cccgctgaat 


ttaagcatat 


tagtcagcgg 


7440 


aggaaaagaa 


actaaccagg 


att ccct cag 


t aa cqqcqa q 


tgaacaggga 


agagcccagc 


7500 


gccgaat ccc 


cgccgcgcgt 


cqcQQcqtaq 


gaaatgtggc 


gtacggaaga 


cccactcccc 


7560 


ggcgccgctc 


gtggggggcc 

^ ^^^ 


caagtccttc 


tgatcgaggc 


ccagcccgtg 


gacggtgtga 

— ' _J 3 3 3 


7620 


ggccggtagc 


ggccccggcg 


cgccgggctc 


gggtcttccc 


ggagtcgggt 


tgcttgggaa 


7680 


tgcagcccaa 


aqcgqqtgqt 


aaactccatc 


taaggctaaa 


taccggcacg 


agaccgatag 


7740 


tcaacaagta 


ccqtaaggqa 


aagttgaaaa 


gaactttgaa 


qagagaqttc 

3 3 3 3 — — — 


aagaggqcgt 

3 3 3 3 3 


7800 


gaaaccgtta 


agaggtaaac 


qqgtqqgqtc 

333 3333 


cgcgcagtcc 


qcccqgaqga 

3 3 3 3 3 


ttcaacccgg 


7860 


cggcgcgcgt 


ccggccgtgc 


ccggtggtcc 


cggcggatct 


ttcccgctcc 


ccgttcctcc 


7920 


cgacccctcc 


acccgcgcgt 


cgttcccctc 


ttcctccccg 


cgtccggcgc 


ctccggcggc 


7980 


gggcgcgggg 


ggtggtgtgg 


tggtggcgcg 


cgggcggggc 


cgggggtggg 


gtcggcgggg 


8040 


gaccgccccc 


ggccggcgac 


cggccgccgc 


cgggcgcact 


tccaccgtgg 


cggtgcgccg 


8100 


cgaccggctc 


cgggacggcc 


gggaaggccc 


ggtggggaag 


gtggctcggg 


gggggcggcg 


8160 


cgtctcaggg 


cgcgccgaac 


cacctcaccc 


cgagtgttac 


agccctccgg 


ccgcgctttc 


8220 
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gccgaatccc 


ggggccgagg 


aagccagata 


cccgtcgccg 


cgctctccct 


ctccccccgt 


8280 


ccgcctcccg 


ggcgggcgtg 


cr cr cr a t cr cr cr cr cr 


ccgggccgcc 


cctcccacgg 


cgcgaccgct 


8340 


ctcccacccc 


cctccgtcgc 


ctctctcggg 


gcccggtggg 


cf cr cf c cr cr cf cf c cf 


gactgtcccc 


8400 


agtgcgcccc 


gggcgtcgtc 


gcgccgtcgg 


gtcccggggg 


gaccgtcggt 


cacgcgtctc 


845 0 


ccgacgaagc 


cgagcgcacg 


gggtcggcgg 


cgatgtcggc 


tacccacccg 


acccgtcttg 


8520 


aaacacggac 


caaggagtct 


aacgcgtgcg 


cgagtcaggg 


gctcgtccga 


aagccgccgt 


8580 


ggcgcaatga 


aggtgaaggg 


ccccgcccgg 


gggcccgagg 


tgggatcccg 


aggcctctcc 


8640 


agtccgccga 


gggcgcacca 


ccggcccgtc 


tcgcccgccg 


CCTCcggagag 


gtggagcacg 


8700 


agcgtacgcg 


ttaggacccg 


aaagatggtg 


aactatgctt 




aagccagagg 


8760 


aaactctggt 


ggaggtccgt 


agcggtcctg 


acgtgcaaat 


cggtcgtccg 


acctgggtat 


8820 


aggggcgaaa 


gactaatcga 


accatctagt 


agctggttcc 


ctccgaagtt 


tccctcagga 


8880 


tagctggcgc 


tctcgctccc 


gacgtacgca 


gttttatccg 


gtaaagcgaa 


tgattagagg 


8940 


tcttggggcc 


gaaacgatct 


caacctattc 


tcaaacttta 


aatgggtaag 


aagcccggct 


9000 


cgctggcgtg 


gagccgggcg 


tggaatgcga 


gtgcctagtg 


ggccactttt 


ggtaagcaga 


9060 


actggcgctg 


cgggatgaac 


cgaacgccgg 


gttaaggcgc 


ccgatgccga 


cgctcatcag 


9120 


accccagaaa 


aggtgttggt 


tgatatagac 


agcaggacgg 


tggccatgga 


agtcggaatc 


9180 


cgctaaggag 


tgtgtaacaa 


ctcacctgcc 


gaatcaacta 


gccctgaaaa 


tggatggcgc 


9240 


tggagcgtcg 


ggcccatacc 


cggccgtcgc 


cgcagtcgga 


acggaacggg 


acgggagcg 


9299 
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